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1.  INTRODUCTION 


This  Semiannual  Technical  Report  covers  the  period  from  January  1 
through  June  30,  1973.  Our  efforts  have  been  in  four  major  areas:  advanced 
packet-switching  systems,  including  multiaccess  satellite  and  packet  radio 
systems;  computer  communication  network  design;  multiple  resource  multiple 
access  computer  systems  models;  and  measurements  on  the  ARPANET  itself.  In 
addition,  we  have  been  involved  with  some  network  protocol  studies  and  some 
controlled  access  and  security  questions.  The  results  of  that  research 
have  been  documented  and  are  listed  in  Section  2  following. 

In  this  report  we  have  attached  three  of  our  publications  which  have 
appeared  in  the  professional  literature;  we  do  not  include  material  from 
other  areas  of  activity,  mentioned  above, in  this  document,  and  the  reader 
is  referred  to  the  referenced  publications  themselves. 

The  first  paper  we  include  in  Section  3  below  has  to  do  with  "Packet 
Switching  in  a  Slotted  Satellite  Channel,"  by  L.  Kleinrock  and  S.  S.  Lam 
(AFIPS  Conference  Proceedings,  1973  National  Computer  Conference  and  Exposi¬ 
tion,  June  4-8,  1973,  toew  York,  N  V.,  pp.  703-710).  In  this  paper  the  basic 
behavior  of  throughput  and  delay  were  studied  for  some  multiaccess  schemes 
for  satellite  communications  in  a  packet  switching  network.  These  schemes 
permit  a  number  of  earth  stations  to  simultaneously  access  the  capacity  of 
a  shared  satellite  channel,  thereby  extending  the  multiplexing  principles 
of  packet  switching  to  satellites.  Two  related  papers  presented  by  others 
at  the  NCC  session  on  satellites  were  "Dynamic  Allocation  of  Satellite 
Capacity  through  Packet  Reservation,"  by  L.  G.  Roberts,  and  "Packet  Switching 
with  Satellites,"  by  N.  Abramson. 

A  second  paper  included  below  and  entitled  "The  Flow  Deviation  Method: 

An  Approach  to  Store- and-Forward  Communication  Network  Design,"  by  L.  Fratta, 
M.  Gerla,  and  L.  Kleinrock  (Networks ,  3:97-133,  1973),  summarizes  some  of 
the  major  concepts  of  the  flow  deviation  method  for  computer  network  design; 
this  method  was  discussed  in  the  previous  Semiannual  Technical  Report 
(December  31,  1972),  but  the  current  paper  delves  into  the  foundations  more 
deeply.  The  flow  deviation  method  leads  to  an  efficient  design  procedure 
for  networks.  This  paper  is  included  as  Section  4. 

The  fifth  section  contains  the  paper  "On  Non-Blocking  Switching  Net¬ 
works,"  by  D.  G.  Cantor  (Networks ,  1:367-377,  1972).  The  problem  discussed 
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is  that  of  finding  switching  networks  which  are  guaranteed  to  be  non- 
blockingjjn  that  any  idle  input  terminal  may  always  be  connected  to  any  idle 
output  terminal.  This  is  a  basic  problem  in  circuit  switching  and  is  the 
starting  point  for  some  of  onr  studies  comparing  circuit  switching  to  message 
switching. 
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Packet-switching  in  a  slotted  satellite  channel* 

by  LEONARD  KLEINROCK  and  SIMON  S.  LAM 

Univenity  of  California 
Lon  Anfdw,  California 


INTRODUCTION 

c 

Imagine  that  two  users  require  the  use  of  a  eommunieation 
channel.  The  classical  approach  to  satisfying  this  require¬ 
ment  is  to  provide  a  channel  for  their  use  so  long  as  fiat  need 
continues  (and  to  charge  them  for  the  full  cost  of  this  chan¬ 
nel).  It  has  long  been  rceognixed  that  such  allocation  of 
scarce  eommunieation  resources  is  extremely  wasteful  as 
®  witnessed  by  their  low  utilisation  (see  for  example  the  meas¬ 

urements  of  Jacks',  i  &  Stubbs).1  Rather  than  provide  chan¬ 
nels  on  a  user-pair  basis,  ve  much  prefer  to  provide  a  single 
high-sp<>ed  channel  to  a  large  number  of  users  which  can  be 
shared  in  some  fashion;  this  then  allows  us  to  take  advantage 
of  the  powerful  “large  number  laws"  which  state  that  with 
g  very  high  probability,  the  demand  at  any  instant  will  be  ap¬ 

proximately  equal  to  the  sum  of  the  average  demands  of  that 
population.  In  this  way  the  required  channel  capacity  to  sup¬ 
port  the  user  traffic  may  be  considerably  less  than  la  the 
unaharid  ease  of  dedicated  channels.  This  approach  has  been 
used  to  great  effeet  for  many  years  now  in  a  number  of  differ¬ 
ent  contexts:  for  example,  the  use  of  graded  channels  in  the 
$  telephone  industry,5  the  introduction  of  asynchronous  time 

division  multiplexing,5  and  the  packet-switching  concepts 
introduced  by  Baran  et  al.,‘ Davies,5  and  finally  implemented 
in  the  ARP  A  network.*  The  esoenti&l  observation  i«  that  the 
full-time  allocation  of  a  fraction  of  the  chanr.el  to  each  user 
is  highly  inefficient  compared  to  the  part-time  use  of  the  full 
capacity  of  the  channel  (this  is  precisely  the  notion  of  time- 
sharing).  We  gain  this  efficient  sharing  when  the  traffic  con¬ 
sists  of  rapid,  but  short  burrti  of  data.  The  da^iral  schemer 
of  synchronous  time  division  multiplexing  and  frequency 
division  multiplexing  are  examples  of  the  inefficient  parti¬ 
tioning  of  channels. 

As  soon  as  we  introduce  the  notion  of  a  shared  channel  in  a 
^  packet-switching  mode  then  we  must  be  prepared  to  resolve 

9  etmfliets  whkh  arise  when  mew  than  on-  demand  is  straub 

tancously  placed  upon  the  channel.  There  are  two  obviouR 
solutions  to  this  problem:  the  first  is  to  “throw  out"  or  “lose” 
any  demands  which  are  made  while  the  channel  is  in  use; 
and  the  second  is  to  form  a  queue  of  conflicting  demands  and 
serve  them  in  some  order  as  the  channel  becomes  free.  The 


*  This  research  was  supported  by  the  Advanced  Keaearch  Projects 
Aftency  of  the  Department  of  Defense  under  Contract  No.  DAHC15-69 
C  0285 


latter  approach  is  that  taken  in  the  ARPA  network  since 
storage  may  be  provided  economically  at  the  point  of  con- 
fliet.  The  former  approach  is  taken  in  the  ALOHA  system5 
which  uses  packet -switching  with  radio  channels;  in  this  sys¬ 
tem,  in  fact,  all  simultaneous  demands  made  on  the  channel 
are  lost. 

Of  interest  to  this  paper  is  the  consideration  of  satellite 
channels  for  packet-switching.  The  definition  of  a  packet  is 
merely  a  package  of  data  which  has  been  prepared  by  a  user 
for  transmission  to  some  other  user  in  the  system.  The  satel¬ 
lite  is  characterized  as  a  high  capacity  channel  with  a  fixed 
propagation  delay  which  is  large  compared  to  the  packet 
transmission  time  (see  the  next  section).  The  (stationary) 
satellite  acts  as  a  pure  transponder  repeating  whatever  it  re¬ 
ceives  and  bet/ning  this  transmission  back  down  to  earth; 
this  broadcasted  transmission  can  be  heard  by  every  user  of 
the  system  and  in  particular  a  user  can  listen  to  his  own 
transmission  on  its  way  back  down.  Since  the  satellite  is 
merely  transponding,  then  whenever  a  portion  of  one  user’s 
transmission  reaches  the  satellite  while  another  user’s  trans¬ 
mission  is  being  t  rang  ponded,  the  two  collide  and  “destroy” 
each  other,  The  problem  we  are  then  faced  with  is  how  to 
control  the  allocation  of  time  at  the  satellite  in  a  fashion 
which  produces  an  acceptable  level  of  performance. 

The  ideal  situation  would  be  foe  ‘he  users  to  agree  collec¬ 
tively  when  each  could  transmit.  The  difficulty  is  that  the 
means  for  communication  available  to  these  f  -ographicaliy 
distributed  users  is  the  satellite  channel  itself  and  we  are. 
faced  with  attempting  to  control  a  channel  which  must  carry 
its  own  control  infumiAlk  n  There  bjt  event  tally  three  ap¬ 
proaches  to  the  solution  of  this  problem.  The  first  has  come 
to  be  known  as  a  pure  “ALOHA”  system5  in  which  users 
transmit  any  time  they  desire.  If,  after  one  propane* ion  de¬ 
lay,  they  hear  their  successful  transmission  then  they  assume 
that  no  conflict  occurred  at  the  satellite;  otherwise  they  know 
a  toHfeksn  oecwrvd  and  they  must  ret  ransmit.  H  utwrs  re¬ 
transmit  immediately  upon  hearing  a  conflict,  then  they  are 
likely  to  conflict  again,  and  so  some  scheme  must  be  devised 
for  introducing  a  random  retransmission  delay  to  spread 
these  conflicting  paekets  over  time. 

The.  second  method  for  using  the  satellite  channel  is  to 
“slot”  time  into  segments  whose  duration  is  exactly  eoual  to 
the  transmission  time  of  a  single  packet  (we  assume  con¬ 
stant  length  paekets).  If  we  now  require  all  packets  to  begin 
their  transmission  only  at  the  beginning  of  a  slot,  then  we 
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enjoy  a  gain  in  efficiency  Mince  collisions  tor  now  rent  rioted  to 
a  Mingle  mIoI  duration;  such  a  scheme  in  referred  to  an  a 
".slotted  ALOHA"  system  and  ia  the  principal  subject  of  thin 
paper.  We  consider  two  models:  the  first  m  that  of  a  large 
population  of  users,  each  of  which  makes  a  small  demand  on 
the  clianncl;  the  second  model  consists  of  this  background  of 
users  with  the  addition  of  one  large  user  acting  in  a  special 
way  to  provide  an  increased  utilisation  of  the  channel.  We 
concern  ourselves  w<th  retransmission  strategies,  delays,  and 
throughput.  Abramson'  also  considers  slotted  systems  and  i» 
concerned  mainly  with  the  ultimate  capacity  of  these  chan¬ 
nels  with  various  user  mixes.  Our  results  and  his  have  a  com¬ 
mon  meeting  point  at  some  limits  which  will  be  described 
below. 

The  third  method  for  using  these  channels  is  to  attempt  to 
.schedule  their  use  in  some  direct  fashion;  this  introduce  the 
notion  of  a  reservation  system  in  which  time  slots  are  re¬ 
served  for  specific  users’  transmissions  and  the  manner  in 
which  these  reservations  are  made  is  discuss'd  in  the  paper 
by  Robert  s.'  He  gives  an  analysis  for  the  delay  and  th.ough- 
put,  comparing  the  performance  of  slotted  and  reservation 
systems. 

Thus  we  are  faced  with  a  finite-rapacity  communication 
channel  subject  to  unpredictable  and  conflicting  demands. 
When  these  demands  collide,  we  "lose"  some  of  the  effective 
capacity  of  the  channel  and  in  this  paper  we  characterise  the 
effect  of  that  conflict.  Note  that  it  is  possible  in  use  the  chan¬ 
nel  up  to  its  full  rated  capacity  when  only  a  single  user  is 
demanding  service,  this  is  true  since  a  user  win  icver  con¬ 
flict  with  himself  (he  has  the  capability  to  schedule  his  own 
use).  This  effect  is  important  in  studying  the  non-uniform 
traffic  case  as  we  show  below. 

SLOTTED  ALOHA  CHANNEL  MODELS 

Model  /.  Traffic  from  many  email  users 

In  this  model  we  assume: 

(A1 )  an  infinite  numb. ::f  users*  who  collectively  form  an 
it.depei.ikrnt  smsm 

This  source  generates  M  packets  per  slot  from  the  distribu¬ 
tion  v,  =  ProbfAf  =  t]  with  a  mean  of  St  packets/ slot. 

We  assume  that  each  packet  is  of  constant  length  requiring 
T  seconds  for  transmission;  in  the  numerical  studies  pre¬ 
sented  below  we  assume  that  the  capacity  of  the  channel  is 
50  kilobits  per  second  and  that  the  packets  are  each  i  125  bits 
in  length  yielding  T ~  22.5  msec.  Note  that  -SY  =  .So/ 7  is  the 
average  number  of  packets  arriving  per  second  from  the 
source.  Let  d  be  the  maximum  roundtrip  propagation  delay 
which  we  assume  each  user  experiences  and  let  R=d/T  be 
the  number  of  slots  which  can  fit  into  one  roundtrip  propaga¬ 
tion  time;  for  our  numerical  results  we  assume  d  =  2? 0  msec, 
and  so  R  =  12  slots,  R  slots  after  a  transmission,  a  user  will 

*  TWe  will  b*  referred  to  «« the  "imtU"  user*. 


either  hear  that  it  was  suceetwful  or  know  tluit  i!  was  de¬ 
stroyed.  In  the  latter  caw  if  he  now  retransmits  during 
the  next  slot  interval  and  if  all  othe~  users  (shave  like¬ 
wise,  then  for  sure  they  will  collide  again;  eonseqtiently 
we  shall  assume  tluit  each  user  transmits  a  previously  col¬ 
lided  packet  at  random  during  one  of  the  next  K  slots, 
(each  such  slot  being  chosen  with  probability  1/A’).  Thus, 
retransmission  will  take  place  either  ft+1,  R+ 2,  ...  or 
R-^K  slots  after  the  initial  transmission.  As  a  result  traffic 
introduced  to  the  channel  from  our  collection  of  users  will 
now  consist  of  new  packets  and  previously  blocked  packets, 
the  total  number  adding  up  to  .V  packets  transmitted  per 
slot  where  pi  =  Proh YV  =  0  with  a  mean  traffic  of  (1  packets 
pee  slot.  Wc  assume  that  each  user  in  the  infinite  popu¬ 
lation  will  have  at  most  one  packet  requiring  transmission 
at  any  time  (including  any  previously  blocked  packets). 
Of  interest  to  as  is  s  description  of  the  maximum  through¬ 
put*  rate  S  as  a  function  of  the  channel  traffic  G.  It  is  clear 
that  S/G  is  merely  the  probability  of  a  successful  trans¬ 
mission  and  G/S  is  the  average  number  of  times  a  packet 
must  be  transmitted  until  success;  assuming 

(A2)  the  traffic  entering  the  channel  is  ac  independent 
process 

We  then  have, 

S=Gpt  (1) 

If  in  addition  we  assume, 

(AS)  the  channel  traffic  is  Poisson 
then  pt~  e-a,  and  so, 

.S’  =  Ge~°  (2) 

Eq.  (2)  was  first  obtained  by  Roberts"  who  extended  a  simi¬ 
tar  result  duo  to  Abramson7  in  studying  tlno  rudio  \UJH  A 
system.  It  represents  the  ultimate  throughput  in  a  Model  I 
slott'd  ALOHA  channel  without  regard  to  the  delay  packets 
experience;  we  deal  extensively  with  the  delay  in  the  next 
section. 

For  Model  1  we  adopt  assumption  Al.  We  shall  also  accept 
a  less  restrictive  form  of  assumption  A2  (namely  assumption 
A4  below)  which,  as  we  show,  lends  validitv  to  assumption 
A3  which  wc  also  require  in  this  model.  Assume, 

(A4)  the  channel  traffic  is  independent  over  any  A'  con¬ 
secutive  slots 

We  have  conducted  simulation  experiments  which  show  that 
this  is  an  excellent  assumption  so  long  as  A’  <  R. 

Let, 

«e 

Up, s'  (3) 

*-0 

Ew  (4) 

i-0 

*  Note  that  S— S\,  under  stahlo  system  operation  which  we  assume 
unless  stated  otherwise  (see  below}. 
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U(ting  only  amumptioi;  A4  ami  the  assumption  that  M  is  in¬ 
dependent  oIN-M,  we  find  f  10]  that  /’(z)  may  lx*  expnwBtxl 

as 

If,  further,  the  source  is  an  independent  process  (i.c.,  as¬ 
sumption  Al)  and  Is  Poisson  distributed  then  F(z) 
and  then  we  see  immediately  that, 

at* 

Thu  shows  that  assumption  A3  follows  from  assumptions 
Al  and  A4  in  the  limit  of  large  K,  under  the  reasonable  con¬ 
dition  that  the  source  is  Poisson  distributed. 

We  have,  so  far  defined  the  following  critical  system  param¬ 
eters:  St,  S,  G,  K  and  R.  In  the  ensuing  analysis  we  shall 
distinguish  packet1'  transmitting  in  a  given  slot  as  being 
either  newly  generated  or  ones  which  have  in  the  past  col¬ 
lided  with  other  packets.  This  leads  to  an  approximation 
since  we  do  not  distinguish  how  many  times  a  packet  has 
met  with  a  collision.  We  have  examined  the  validity  of  this 
approximation  by  simulation,  and  have  found  that  the  cor¬ 
relation  of  traffic  in  different  slots  is  negligible,  except  at 
shifts  of  ft+1,  R+ 2,  . .  . ,  It+K;  this  exactly  supports  our 
uppsnximation  since  we  concern  ourselves  with  the  most  re¬ 
cent  collision.  We  require  the  following  two  additional 
definitions: 

q  -  Prob[newly  generated  packet  is  successfully 
transmitted] 

v,  -  PnAj[pr(.vioUsly  blocked  packet  is  tmrasshtHy 
transmitted] 

We  also  introduce  the  expected  packet  delay  D- 

/)  =  average  time  (in  slots)  until  a  packet  is 
•Aicrcuefully  nrrivrd 

Our  principal  conr.ein  in  this  paper  is  to  investigate  the 
trade-off  between  the  average  delay  D  and  the  throughput  S. 

Model  II.  Background  traffic  with  one  large  user 

In  this  second  modei,  we  refer  to  the  source  described 
above  as  the  "background1'  source  but  we  also  assume  that 
there  is  an  additional  single  user  who  constitutes  a  <econd 
independent  source  and  we  refer  to  this  source  as  the  ‘large" 
user.  The  background  source  is  the  same  as  that  m  Model  I 
and  for  the  second  source,  we  assume  that  the.  packet  arrivals 
to  the  large  user  transmitter  are  Poisson  and  independent  of 
other  packets  over  R-\-K  consecutive  slots.  In  order  to  dis¬ 
tinguish  variables  for  these  two  sources,  we  let  .S’<  and  G<  refer 
to  the  S  and  ('<  parameters  for  the  background  source  and  let 
S;  and  (!t  refer  to  the  S  and  (S  parameter"  for  the  single  large 
user.  We  point  out  tliut  the  identity  of  this  large  user  may 


change  os  time  progresses  but.  insist  that  there  he  only  one 
such  at  tmy  given  time.  We  intnxluee  the  new  variables 

**»«,  +  «.  Ci) 

(i  •»  fi'i+fii  (l>) 

S  represents  the  total  throughput  of  the  system  and  G  repre¬ 
sents  the  traffic  which  the  channel  must  support  (including 
retransmissions).  We  have  assumed  that  the  small  users  may 
have  at  most  one  packet  outstanding  for  transmission  in  the 
channel;  however  the  single  large  user  may  have  many  pack¬ 
ets  awaiting  transmission.  We  assume  that  this  large  user  has 
storage  for  queueing  his  requests  and  of  course  it  is  his  re¬ 
sponsibility  to  see  that  he  does  not  attempt  the  simultaneous 
transmission  of  two  packets.  We  may  interpret  (7j  as  the 
probability  that  the  single  large  user  is  transmitting  a  packet 
in  a  channel  slot  and  so  we  require  Gi  <  1;  no  such  restriction 
is  placed  on  G,  (or  on  G  in  Model  I ) . 

We  now  introduce  a  means  by  which  the  large  user  can 
control  his  channel  usage  enabling  bim  to  absorb  some  of  the 
slack  channel  capacity;  this  permits  an  increase  in  the  total 
throughput  S.  The  set  of  packets  awaiting  transmission  by 
the  large  user  compete  among  each  other  for  the  attention  of 
his  local  transmitter  &a  iVJIijWk  Kadi  waiting  packet  will  he 
scheduled  for  transmission  in  some  future  slot.  When  a  newly 
generated  packet  arrives,  it  immediately  attempts  trans¬ 
mission  in  the  current  slot  and  will  succeed  in  capturing  the 
transmitter  unlea,  some  other  packet  has  also  been  scheduled 
for  this  slot;  in  the  case  of  such  a  scheduling  conflict,  the  new 
packet  is  randomly  rescheduled  in  one  of  the  next  L  slots, 
each  such  slot  being  chosen  equally  likely  with  probability 
1  ,/L.  Due  to  the  background  traffic,  a  large  user  packet  may 
nun  with  a  tronamu»'i<ni  conflict  at  the  satellite  (which  is 
discovered  R  slots  after  transmission)  in  which  case,  as  in 
Model  I,  it  incurs  a  random  delay  (uniformly  distributed 
over  K  slots)  plus  the  fixed  delay  of  R  slots.  More  than  one 
packet  may  be  scheduled  for  a  future  slot  and  we  assume 
that  these  scheduling  conflicts  are  resolved  by  admitting  that 
packet  with  the  longest  delay  since  its  previous  blocking  (due 
to  rorJUrt  in  tr&tuuausion  or  conflict  in  whnluUng  )  and  uni¬ 
formly  rescheduling  the  others  over  the  next  L  slots;  tics  ar' 
broken  hy  rand..m  selection.  We  sec:,  therefore,  that  new 
packets  have  the  lowest  priority  in  case  of  a  scheduling  con¬ 
flict;  however,  they  seize1  the  channel  if  it  is  free  upon  their 
arrival.  The  variable  L  permits  us  a  certain  control  of  chan¬ 
nel  i.re  hy  the  large  user  but  does  not  limit  his  throughput. 
We  also  assume  K,  L<R.  Corresponding  to  q  and  q,  in  Model 
I,  we  introduce  the  success  probabilities  q,  and  q,,  (i-  1,  2) 
for  new  and  previously  blocked  packets  respectively  and 
where  i  =  l  drtateit  the  background  source  it  d  »=2  denotes 
the  single  large  source.  Finally,  we  choose  to  distinguish  be¬ 
tween  Dt  and  D%  which  are  the.  average  number  of  slots  until 
a  packet  is  successfully  transmitted  from  the  background 
and  large  user  sources  respectively. 

RESULTS  OF  ANALYSIS 

In  *his  section  wc  present  the.  results  of  our  analysis  with¬ 
out  proof  The  details  of  proof  may  he  found  in  Reference  10. 
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Figure  1  —Throughput  u  c  (unction  of  channel  trsflir 


Model  /.  Traffic  from  many  small  users 

W<>  'Tish  to  refine  Kq.  (?)  by  accounting  for  the  effect  zt 
the  random  retransmission  oelay  jarameter  ft.  Our  principal 
rc  ult  in  this  case  is 


S-Cr 


SH-1-9 


where 


«-s 


(7) 


(8) 


and 


7,4ii5]Cr™“^<rM+  H  *  *  (&) 

The  considerations  which  led  to  Eq.  (7)  were  inspired  by 
Huberts11  in  which  he  developed  an  approximation  for  Eq. 
(9)  of  the  form 


gi¬ 


ft- 1 


(10) 


We  shall  see.  below  that  this  is  a  reasonably  good  » pp; <  .:ima- 
tion.  Equations  (7-9)  form  a  set  of  non-linear  simultaneous 
equations  for  S,  q  and  j,  which  must  be  solved  to  obtain  an 
explicit  expression  for  S  in  terms  of  the  system  parameters 
(7  and  ft.  In  general,  this  cannot  be  accomplished.  However, 
we  note  that  as  ft  approaches  infinity  these  three  equations 
reduce  simply  to 

Lim-  =  Lim  g  =  Lim  q,=e~°  (11) 

«t»  0  R«t>  | 

Thus,  we  see  that  Eq.  (2)  is  the  correct  expression  for  the 
throughput.  .S’  only  when  ft  approaches  infinity  which  cor¬ 
responds  to  the  case  of  infinite  average  delay;  Abramson* 
gives  this  result  and  numerous  others  all  of  which  corre¬ 
spond  to  this  limiting  case.  Note  that  the  large  ft  case  avoids 


the  large  delay  problem  if  ?  is  small  (very  high  epeed  chan¬ 
nels). 

The  numerical  solution  to  Eqs.  (7-9)  is  given  in  Figure  1 
where  we  plot  the  throughput  S  as  a  function  of  the  channel 
traffic  G  for  various  values  of  ft.  We  note  that  the  maximum 
throughput  nt  a  given  ft  occurs  when  G  =  1.  The  throughput 
improves  at  ft  increases,  finally  yielding  a  maximum  value 
of  N=  l/«  =  .368  for  G=  1,  ft = infinity.  Thus  we  have  the  un¬ 
fortunate  situation  that  the  ultimate  capacity  of  this  channel 
supporting  a  large  number  of  small  users  is  less  than  37  per¬ 
cent  of  its  theoretical  maximum  (of  1).  We  note  that  the 
efficiency  rapidly  approaches  this  limiting  value  (of  1/e)  as 
K  increases  and  that  for  ft  =15  we  are  almost  there.  The 
figure  also  shows  some  delay  contours  which  we  discuss 
below.  In  Figure  2,  we  show  the  variation  >f  q  and  q,  with  ft 
for  various  values  of  G.  We  note  how  rapidly  these  functions 
approach  their  limiting  values  as  given  in  Eq  (11).  Also  on 
this  curve,  we  have  shown  Roberts’  approximation  in  Eq. 
(10)  which  converges  to  the  exact  value  very  rapidly  as  ft 
increases  and  also  as  G  decreases. 

Our  next  significant  resub  is  for  packet  delay  as  given  by 

D=ft+l+I^[ft+l+^p]  (12) 

We  note  from  this  equation  that  for  larga  ft,  the  average 
delay  grows  linearly  with  ft  at  •  slope 

dD  1  -tr' 

X™  dft  “  2e  ~c 

Using  Eq.  (11),  w»  sec  that  this  slope  may  be  expressed  as 
G—S/2S  which  is  merely  the  ratio  of  that  portion  of  trans¬ 
mitted  traffic  which  meets  with  a  conflict  to  twice  the  through¬ 
put  ot  the  channel;  since  G—  8/28~)$(G/8—  1),  we  see 
that  the  limiting  slope  is  •■qual  to  times  the  average 
number  of  times  a  packet  is  retransmitted.  Little’s  well- 
known  result1*  expresses  the  average  number  (71)  of  units 
(packets  in  our  case)  in  a  queueing  system  as  the  product  of 
the  average  arrival  rate  (.8*=  8  in  our  case)  and  the.  average 
time  in  system  (£>).  If  we  use  this  along  with  Eqs.  (7)  and 
(12),  we  get 

rt=SZ)  =  (7^ft+l-f  — j— -j— s|— jp-J  (13) 


Figure  2— ^uccew  probabilities  as  a  function  of  ret  ra  nr  mission  delay 
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In  Figure  1  wc  plot  the  loci  of  constant  delay  in  the  S,  0 
plane.  Note  the  way  these  loci  bend  over  sharply  as  K  in¬ 
creases  defiring  a  maximum  throughput  Sm„(D)  for  any 
given  value  of  D\  we  note  the  coat  in  throughput  if  wo  wish 
to  limit  the  average  delay.  This  effect  is  clearly  seen  in 
Figure  3  which  is  the  fundamental  display  of  the  tradeoff 
between  delay  and  throughput  lor  Model  I;  this  figure  shows 
the  delay-throughput  contours  for  constant  values  of  K.  We 
also  give  the  minimum  envelope  of  those  contours  which  de¬ 
fines  the  optimum  performance  curve  for  this  system  (a 
similar  optimum  curve  is  also  shown  in  Figure  1).  Note  how 
sharply  the  delay  increases  near  the  maximum  throughput 
£••0.368;  it  is  clear  that  an  extreme  price  in  delay  must  be 
paid  if  one  wishes  to  push  the  channel  throughput  much 
above  0.3'  0  and  the  incremental  gain  in  throughput  here  is 
infinitesimal.  On  the  other  hand,  as  £  approaches  zero,  D 
approaches  fl+1.  Also  shown  here  are  the  constant  0  con¬ 
tours.  Thus  this  figure  and  Figure  1  arc  two  alternate  ways  of 
displaying  the  relationship  among  the  four  critical  system 
quantities  S,G,K,  and  D. 

From  Figure  3  we  observe  the  following  effect.  Consider 
any  given  value  of  S  ( say  at  S  =  0.20),  and  some  given  value 
of  K  (say  K-2).  We  note  that  there  are  two  possible  values 
of  I)  which  satisfy  these  conditions  (D-21.8,  D*=  161).  How 
do  we  explain  this?*  It  is  clear  that  the  lower  value  is  a  stabie 


THROUGHPUT  (PACKETS/SLOTI  S 

Figure  3  Delay-throughput  tradeoff 

*  This  question  «»s  raised  in  a  private  conversation  with  Martin  Gra¬ 
ham  (University  of  Califomi»,  Berkeley!  A  simulation  of  thia  situation 
is  reported  upon  in  Reference  13 


THROUGHPUT  I  PACKET  J/SC0TI  S 

Figure  4— Optimum  K 


operating  point  since  the  system  has  sufficient  capacity  to 
absorb  any  fluctuation  in  the  rate  S*.  Su  ypoee  that  we  now 
slowly  increase  &  (the  source  rate) ;  so  long  as  we  do  not 
exceed  the  maximum  value  of  the  system  throughput  rate 
for  this  K  (say,  Smtt(K)},  then  wt  see  that  S-  Sc  and  the 
system  will  follow  the  input.  Note  that  Sm««(K)  always  oc¬ 
curs  at  the  intersection  of  the  G«1  curve  as  noted  earlier. 
However,  if  we  attempt  to  set  S,>S*,i(K),  then  the  sys¬ 
tem  will  go  unstable!  In  fact,  the  throughput  S  will  drop 
from  SmAX(K)  toward  xero  as  the  system  accelerates  up  the 
constant  K  contour  toward  infinite  delay!  The  system  will 
remain  in  that  unfortunate  circumstance  so  long  as  S «>£ 
(where  now  S  is  approaching  gero).  All  during  its  demise,  the 
rate  at  which  new  packctB  are  being  trapped  by  the  system  is 
So— S.  To  recover  from  this  situation,  one  can  set  S»  =  0; 
then  the  delay  will  proceed  down  the  K  contour,  round  the 
bend  at  S„,,(K)  and  race  down  to  S»0.  All  this  while,  the 
backlogged  packets  are  being  flushed  out  of  the  system..  The 
warning  is  clear;  one  must  avoid  the  knee  of  the  K  contour. 
Fortunately,  the  optimum  performance  curve  does  avoid  the 
knee  everywhere  except  when  one  attempts  to  squeeze  out 
the  last  few  percent  of  throughput.  In  Figure  4,  we  show  the 
optimum  values  of  K  as  a  function  of  S.  Thus,  we  have  char¬ 
acterized  the  tradeoff  between  throughput  and  delay  for 
Model  I. 


Model  II.  Background  traffic  mih  one  lar  \e  user 

In  this  model  the  throughput  equation  is  similar  to  that 
given  in  Eq.  (7),  namely, 


"-'I  v*  .  ,  -  v  *  ■  / 

V.i-4- 1  — 

the  quantities  q,t  and  q,  are  given  in  the  appendix.  Similarly 
the  average  delays  for  the  two  classes  of  user  are  given  by 

Z>>  =  fl+l  +  l~Jl  jfl+l-b  ( 15) 

ft=fl+t+  ^  [*+i+  ~  L+  ■-=**,] 

qu  L  2  j  2  L  qu  J 
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where  E.  and  E,  arc  given  in  the  appendix.  It  is  easy  to  show 
t  hat  as  K,  L  approach  infinity, 


9l  =  9lr  =  <r°‘(l-0j) 

(17) 

.SWA*  0,(1-C.) 

(18) 

9s  =  9ti=*«“°‘ 

(19) 

St=(!tr°' 

(20) 

S  =  (G-GiGt)e-°  ‘ 

(21) 

■Gi +(/'  and  .S*=  Ni+Si. 

From  these  last 

equations  or  as  given  by  direct  arguments  in  an  unpublished 
note  by  IloltcrtH,  one  may  easily  show  that  at  a  constant 
background  user  throughput  Si,  the  large  user  throughput 
St  will  be  maximized  when 

f/«rt,r0,-l  (22) 

This  last  is  a  special  case  of  results  obtained  by  Abramson  in 
Reference  8  and  he  discusses  these  limiting  cases  at  length  for 
various  mixes  of  users.  We  ,iote  that, 

(23) 

dS 

—  =-e-°'(0-0,6',-l-'-0,)  (24) 

JO  I 


In  Figure  .r>  we  give  a  qualitative  diagram  of  the  .'{-dimen¬ 
sional  contour  fur  S  as  a  function  of  and  < It.  We  remind  the 
reader  that  this  function  is  shown  for  the  limiting  ease  K,  /, 
approaching  infinity  only.  From  our  results  we  mi1  that  for 
constart  (!i<  1,  S  increases  linearly  with  (It  (Gt<  1 ).  For 
cons  tan.  f»'i>  1,  S  decreases  linearly  as  (»,  inr-rcatn-H.  hi  ad¬ 
dition,  for  constant  (it<xA,  S  lias  a  maximum  value  nt 
6’i»  1  —  2f7j/l—  (/»,  Furthermore,  for  constant  S  de- 

creasis  as  Gt  increases  and  therefore  the  maximum  through¬ 
put  S  must  occur  at  »S*=6’j  in  the  0;  =  O  plane. 

The  optimum  curve  given  in  Eq.  (22)  is  shown  in  the  ,S'i,  St 
plane  in  Figure  6  along  with  the  performance  loci  at  consti  nt 
G i.  We  note  in  these  last  two  figures  that  a  channel  through¬ 
put  equal  tr.  1  is  achievable  whenever  the  background  traffic 
drops  to  zero  thereby  enabling  S“St*Gt=l;  this  corre¬ 
sponds  to  the  case  of  a  single  user  utilizing  the  satellite 
channel  at  its  maximum  throughput  of  1.  Abramson  [8]  dis- 


THSouoseur 

Figure  7-  l)el*y  throughput  tradeoff  at  S,  -0. 1 
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cusses  a  variety  of  curves  such  as  those  in  Figure  6;  he  con¬ 
siders  the  generalization  where  there  may  be  an  arbitrary 
number  of  background  and  large  users. 

In  the  next  three  figii.es,  we  give  numerical  results  for  the 
finite  K  case;  in  all  of  these  computations,  we  consider  only 
the  simplified  situation  in  which  K  =  I.  thereby  eliminating 
one  parameter.  In  Figure  7  we  show  the  tradeoff  between  de¬ 
lay  end  throughput  similar  to  Figure  3.  (Note  that  Figure  .1 
is  similar  to  Figure  1.)  Here  we  show  the  optimum  perform¬ 
ance  of  the  average  delay  />  =  SJ),  +  StDt/S  along  with  the 
behavior  of  D  at  constant  values  of  K  and  <Si  =  0.1  (note  the 
instability  once  again  for  overloaded  conditions).  Also  shown 
are  minimum  curves  for  l)\  and  !\,  which  are  obtained  by 
using  the  optimum  K  as  a  function  of  S.  If  we  are  willing  to 
rcdtiro  the  background  throughput  from  its  maximum  at 
,S|S=0.3()X,  we  can  drive  the  to'al  throughput  up  to  ap¬ 
proximately  .S=*0..Y2  by  introducing  additional  traffic  from 
the  large  user.  Note  that  the  minimum  Di  curve  is  much 
higher  than  the  minimum  l),  curve.  Thus  our  net  gain  in 
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channel  throughput  is  also  at  the  expense  of  longer  packet 
delay*  for  the  small  users.  Once  again,  we  see  the  sharp  rise 
near  saturation. 

In  Figure  8,  we  display  a  family  of  optimum  D  curves  for 
various  choices  of  Si  as  a  function  of  the  total  throughput  S. 
We  also  show  the  behavior  of  Model  I  as  given  in  Figure  G. 
Note  as  we  reduce  the  background  traffic,  the  system  capac¬ 
ity  increases  slowly;  however,  whim  Si  falls  below  0.1,  we 
Is-gin  to  piek  up  significant  gains  for  Si.  Also  observe  that 
each  of  the  constant  curves  "peels  off”  from  the  Model  I 
curve  at  a  value  of  S-  S,.  At  St  **  0,  we  have  only  the  large 
user  operating  with  no  collisions  and  at  this  point,  the  optimal 
value  of  L  is  1.  This  reduces  to  the  classical  queueing  system 
with  Poisson  input  and  constant  service  time  (denoted 
A//D/1)  and  represents  the  abtohde  optimum  performance 
contour  for  any  method  of  using  the  satellite  channel  when 
the  input  is  Poisson;  for  other  input  distributions  we  may- 
use  the  (1/D/X  queueing  results  to  calculate  this  ab>  jlute 
optimum  performance  contour. 

In  Figure  9,  wc  finally  show  the  throughput  tradeoffs  be- 
twoen  the  background  and  large  user*.  The  upper  curve  shows 
the  absolute  maximum  S  at  each  value  of  >S\;  this  is  a  clear 
display  of  '  ac  significant  gain  in  St  which  we  can  achieve  if 
we  are  willing  to  reduce  the  background  throughput.  Tho 
middle  eurve  (also  shown  in  Figure  6  and  in  Reference  8) 
shows  the  absolute  maximum  value  for  St  at  each  value  of 
Si.  The  lowest  curve  shows  the  net  gain  in  system  capacity  as 
»S'i  is  reduced  from  its  maximum  possible  value  of  1  /«. 

CONCLUSIONS 

In  this  paper  we  have  analyzed  the  performance  of  a  slotted 
satellite  system  for  packet-switching.  In  our  first  model,  we 
huve  disolayed  the  trade-off  between  average  delay  and 
average  throughput  and  have  shown  that  in  the  case  of 
traffic  consisting  of  a  large  number  of  small  users,  the  limiting 
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Figure  9-  Throughput  countoura 

throughput  of  the  channel  (1/e)  can  be  approached  fairly 
closely  without  an  excessive  delay.  This  performance  can  be 
achieved  at  relatively  small  values  of  K  which  is  tho  random 
retransmission  delay  parameter.  However,  if  one  attempts  to 
approach  this  limiting  eapecity,  not  only  does  one  encounter 
large  delays,  but  one  also  flirts  with  the  haxards  of  unstable 
behavior. 

In  the  case  of  a  single  large  user  mixed  with  the  background 
traffic,  we  have  shown  that  it  if  possible  to  increase  the 
throughput  rather  significantly.  The  qualitative  behavior  for 
this  multidimensional  trade-off  was  shown  and  the  numerical 
calculations  for  a  given  set  of  parameters  were  also  dis¬ 
played.  The  optimum  mix  of  channel  traffie  was  given  in 
Eq.  (22)  and  is  commented  on  at  length  in  Abramson’s 
paper.'  We  have  been  able  to  show  in  this  paper  the  relation¬ 
ship  between  delay  and  throughout  which  is  an  essential 
trade-off  in  these  slotted  pack  , -switching  systems. 

In  Roberts’  paper*  ho  discusses  an  effective  way  to  reserve 
slots  in  a  satellite  system  so  as  to  predict  and  prevent  con¬ 
flicts.  It  is  worthwhile  noting  that  another  scheme  is  cur¬ 
rently  tieing  investigated  for  packet-switching  systems  in 
which  the  propagation  delay  is  small  compared  to  the  slot 
time,  that  is,  /f  =  d/7’«l.  In  such  systems  i;  may  lie  ad¬ 
vantageous  for  a  user  to  "listen  before  transmitting”  in  order 
to  determine  if  the  channel  is  in  use  by  some  other  user; 
such  systems  are  referred  to  as  "carrier  sense”  systems  and 
seem  to  offer  some,  in  foresting  possibilities  regarding  their 
control.  For  satellite  communications  this  case  may  be 
found  when  the  capacity  of  the  channel  is  rather  small  (for 
example,  with  a  stationary  satellite,  the.  capacity  should  be- 
in  ike  range  of  1200  bps  for  the  packet  sizes  we  have  dis¬ 
cussed  in  this  paper i .  On  the  other  hand,  a.r>0  k’iobit  channel 
operating  in  a  ground  radio  environment  with  packets  on  the 
order  of  100  or  1000  bits  lend  themselves  nicely  to  carrier 
sense  techniques. 

In  all  of  these  schemes  one  must  trade  off  complexity  of 
implementation  with  suitable  performance  This  performance 
must  be  effective  at  all  ranges  of  traffic  intensity  in  that  no 
unnecessary  delays  or  loss  of  throughput  should  occur  due  to 
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complicated  opemliouiU  procedures.  We  feel  tlu.t  tlmslottixi 
satellite  packet-switching  methods  desi-rils-d  in  this  paper 
and  the  rmervnlion  systems  for  these  rhiumels  described  in 
the  paper  by  Roberts  do  in  fact  meet  them;  criteria. 
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APPENDIX 

Define  G»  £  Poisson  arrival  rate  of  packets  to  the  transmitter 
of  the  large  user 

-SxCl+E.-FErfH-E,)]  (A.l) 

The  variables  q„  g„  (t  =l,  2)  in  Eqs.  (14-16)  are  then 

given  m  follows  (see  Reference  10  for  details  of  the  deriva¬ 

tions)  : 

?.»(?.)*(9s)'7ra  (A.2) 

gn*  (g»)'c_‘gi.(g>)t«_a  (A.3) 


g^s-ewK-f  -[(l_e-o.)(e-o.-e-o.(K)-f(71e-fO;«>.)]  (A.4) 

A 


(#7.+n  er°' 


gi“<  j 

— -  (lA~a,lL—e  0')  L> 2 

L  —  1 


[e°‘/'(1-  lZp)  -c-^]  (A. 6) 

Let  us  introduce  the  following  notation  for  events  at  the 
large  user: 

SS = scheduling  success  (capture  of  the  transmitter) 

SC = scheduling  conflict  (failure  to  capture  transmitter) 
TS**  transmission  success  (capture  of  a  satellite  slot) 

TC=  transmission  conflict  (conflict  at  the  satellite) 

AT  =  newly  generated  packet 


r.-fr.E. 

l+£. 

r,+r,E, 

l+E, 


E,  £  average  number  of  SC  events  bi-fore  J  o 

an  SS  event  conditioning  on  AT  - - -  (A. 9) 

a. 

E,  £  average  number  of  SC  events  before  j  _  n 

an  SS  event  conditioning  on  TC  * - -  (A. 10) 

a. 

The  variables  a„  r,  (i*n,  t,  t)  are  defined  and  given  below: 


(a 

{ 

n.AProb  [SS/AT]*{?! 

\g 

H  * ) 

(A. 11) 

I 

r„  £  Prob  [TS/SS,  AT] 

=  qKr 

(A. 12) 

a,aProb[.SlS/rC]=  “ 

A 

1  —  (q»/q)K 
l-go/g 

(A.  13) 

r,  a  Prob  [TS/SS,  TC]  = 

■g*“W-ai 

(A. 14) 

a.  £  Prob  [SS/SC]-^ 

\*g«i-(v*)1 
)  l  i-q; 

(A.i:-) 

r.  £  Prob  [TS/SS,  ,S'C]  =  q*e~3t 

(A.  16) 

where 

n 

q  2=  g—0\lK  _|_  — I  g— (O  |+<?i) 

(A. 17) 
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ABSTRACT 

Two  problem  relevant  to  the  design  of  a  store-and- forward 
communication  network  (the  message  routing  problem  and  the  chan¬ 
nel  capacity  assignment  problem )  are  formulated  and  are  recog¬ 
nised  to  be  essentially  non-linear,  u ncon&tAcUnzd  multicommodity 
(m.c.)  flow  problems .  A  " Flew  Deviation "  (FD)  method  for  the 
solution  of  these  non-linear ,  unconstrained  m.c.  flow  problems 
is  described  which  is  quite  similar  to  the  gradient  method  for 
functions  of  continuous  variables ;  here  the  concept  of  gradient 
is  replaced  by  the  concept  of  ” shortest  route "  flow.  As  in  the 
gradient  method,  the  application  of  successive  flow  deviations 
leads  to  local  minima.  Finally,  two  interesting  applications 
of  the  FD  method  to  the  design  of  the  ARPA  Computer  Network  are 
discussed. 


1 .  INTRODUCTION 


In  this  paper  we  consider  a  procedure  (the  "flow  deviation" 
method)  for  assigning  flow  within  store-and-forward  communica¬ 
tion  networks  so  as  to  minimize  cost  and/or  delay  for  a  given 
topology  and  for  given  external  flow  requirements.  We  begin  by 
defining  the  basic  model  below  and  follow  that  with  some  examples. 
We  then  discuss  various  approaches  to  the  problem  and  then  in¬ 
troduce  and  describe  the  "flow  deviation"  method.  This  method 
is  evaluated  under  some  further  restrictions  and  is  then  applied 
to  various  problem  formulations  for  the  ART  A  network  [6],  (7). 

Suppose  we  have  a  collection  of  nodes  (i=l,...,n)  ,  and 

are  required  to  route  a  quantity  r„  of  type  (i,j)  commodity 
from  N^  to  Nj  through  a  given  network  (Fi' ure  1). 
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Fig.  1  Example  of  routing  of  the  (i,j)  commodity. 


The  multi commodity  (m.c.)  flow  problem  consists  of  finding 
the  routes  for  all  such  commodities,  which  minimize  (or  maxi¬ 
mize)  a  well-defined  performance  function  (e.g.,  cost  or  delay), 
such  that  a  set  of  constraints  (e.g. ,  channel  capacity  con¬ 
straints)  are  satisfied. 

The  most  general  multicommodity  problem  can  be  expressed 
formally  in  the  following  way: 

Given:  A  network  of  n  nodes  and  b  arcs 

An  n  *  n  matrix  R  =  (r. .] ,  called  the  require¬ 
ment  matrix,  whose  entries  are  non-negative 
Minimize:  (or  maximize)*  P(4>) 

over  4>  where  4>  is  the  flow  configuration  and  P  is  a 

well-defined  performance  function 

Furthermore,  4>  must  satisfy  the  following  constraints: 
Constraints: 

1.  must  be  a  multicommodity  flow  satisfying  requirement 
R.  For  rhis,  the  following  conditions  must  be  verified: 

Conservation  of  the  flew  at  nodes,  commodity  by  commodity: 


l  ^j) 


k=l 


kZ 


l  f  ; 

L  £m 


(ij>  . 


m=l 


-  r. .  if  Z  =  i 
ID 

+  r. ,  if  Z  =  j 

IT 

0  otherwise 


Vi,j 


(1.1) 


*  Without  l  os  8  of  generality ,  onty  the.  minimum  problem  is  con¬ 
sidered  in  the  following. 
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Non-negativity  of  flow  in  directed  arcs: 

fk^’  i  0  Vi.j.k.t  (1.2, 

where  f^jj^  is  the  portion  of  commodity  (i,j)  flowing  on  arc 
(k,t) . 

2.  ♦  must  satisfy  some  additional  constraints,*  different 

from  problem  to  problem  (e.g.,  capacity  constraints  on  each 
channel  and/or  cost  constraints) . 

Let  us  define  the  (i,j)  conmodity  flow  as: 


f(iii  -  (» ««> .  f<ij) . 


where  f  J  is  the  portion  of  (i,j)  commodity  flowing  in  arc  m 
m 

and  define  the  global  flow  f  as: 


n  n  .... 

{■ii  fu” 

i»l  j-1 

In  the  sequel,  we  restrict,  our  analysis  to  m.c.  problems  in 
which  the  performance  depends  solely  on  the  global  flow: 

P(*)  =  P(f)  (1.3) 

However,  most  of  the  arguments  and  techniques  presented  in  the 
paper  can  be  extended  to  the  general  case  of  P($>)  explicitly 
depending  upon  various  types  of  conmodities. 

So  far,  we  represented  the  flow  configuration  4>  in  terms 

of  f ,  vi,  j. 

An  equivalent  representation  is  obtained  by  providing  for 

k 

each  commodity  (i,j)  a  set  of  routes  k  =  1,  k„,  from 

k  k 

node  i  to  node  j,  associated  with  some  weights  a. .  (a. ,  >  0, 

J  13  i] 

K.  . 

13  k 

I  a. .  =  1) :  by  this  we  mean  that  commodity  (i,j)  is  trans¬ 
it  13  k 

ferred  from  i  to  i  along  K,  .  routes,  and  route  tt  .  .  carries  an 
k  x3  ^ 

amount  a..  •  r..  of  commodity  (i,j). 

13  13 


*If  an  m.o.  flow  problem  has  no  additional  constraints.  We  de¬ 
fine  it  to  be  an  uncoM&uuned  m.c.  flow  problem ;  such  a  defi¬ 
nition  will  be  motivated  in  one  of  the  following  sections. 


U 


u 


u 


100  FRATTA,  GERLA  AND  KLEIN ROCK 

As  a  third  representation,  we  can  consider  the  global  flow 
f.  It  can  very  tasily  be  seen  that  f  does  not  completely  char¬ 
acterize  <J> :  for  is, stance,  two  different  sets  of  routes  might 
yield  the  same  f.  However,  from  Equation  (1.3) ,  it  turns  out 
that  such  a  representation  is  sufficient  for  many  considerations, 
and  is  certainly  more  compact  than  the  previous  two.  In  the  fol¬ 
lowing  we  use  whichever  of  these  representations  is  most  con¬ 
venient. 

It  can  be  seen  that  the  set  of  m.c.  flows  satisfying  con¬ 
straints  (1.1)  and  (1.2)  is  convex.  In  particular,  if  we  let 

F  =  (f|f  is  an  m.c.  flow  satisfying  constraints  (1.1)  and  (1.2)}, 
we  have  that  F  is  a  convex  polyhedron.  The  global  flows  cor¬ 
responding  to  the  "corners"  (extreme  poi :,t.s)  of  F  have  an  inter¬ 
esting  property:  they  are  shortest  route*  flows  [9] . 

2.  MULTICOMMODITY  PROBLEMS  IN  THE  DESIGN  OF  S/F  NETWORKS 

Let  us  now  consider  a  store-and-forward  (S/F)  communication 
network  (1] .  In  such  a  network,  messages  traveling  from  N^  to 

N.  are  "stored"  in  queue  at  any  intermediate  node  N  ,  while 
J  k 

awaiting  transmission,  and  are  sent  "forward"  to  N^,  the  next 

node  in  the  route  from  N^  to  N^  when  channel  (k, £)  permits. 

Thus,  at  each  node  there  are  different  queues,  one  for  each  out¬ 
put  channel.  The  message  flow  requ  -nts  between  nodes  arise 
at  random  times  and  the  messages  are  ^f  random  lengths;  therefore 
the  flows  in  the  channels  and  the  queue  lengths  in  the  nodes  are 
random  variables.  Under  appropriate  assumptions, +  an  analysis 
of  the  system  can  be  carried  out  [1];  in  particular,  it  is  pos¬ 
sible  to  relate  the  average  delay  T  suffered  by  a  message  travel¬ 
ing  from  source  to  destination  (the  average  is  over  time  and  over 
all  pairs  of  nodes)  to  the  average  flows  in  the  channels. 

The  result  of  the  analysis  is: 


I  -T. 

L.  Y  1 


(2.1) 


i=l 


*A  shortest  route  flow  is  an  m.c.  flow  whose  routes  oan  he  de~ 
cribed  by  a  shortest  route  matrix,  computed  for  an  arbitrary 
assignment  of  lengths  to  the  arcs. 

+ Assumptions :  Poisson  arrivals  at  nodes,  exponential  distribu¬ 
tion  of  message  length,  independence  of  arrival  processes  at 
different  nodes,  independence  assumption  of  service  times  at 
successive  nodes  [2J. 
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where 


T  =  total  ave  .ye  delay  per  message  [sec/messg] 
b  *  number  ol  ,*es  in  the  network 

®  message  rare  on  channel  i  [messg/sec] 


n  n 

y "  l  l  r  y 

i=l  j=l  3 


total  message  arrival  rate  from 
external  sources  [messg/sec] 


T.  =  average  delay  suffered  by  a  message  waiting  for 
channel  i  [sec/messg] 

T^  is  the  sum  of  two  components: 

T  =  >p  ”  -f  rnlt 

i  i  i 


where 

T! 

i 


“ci  -  h 


=  transmission  and  queueing  delay 


TV  =  =»  propagation  delay 


and 

C_^  =  capacity  of  channel  i  [bits/sec] 

1/y  *  average  message  length  [bits/messg] 
We  can  rewrite  Equation  (2.1)  as  follows: 
b 

+  (X,/y)yp. 


-l 

V  *- 


CL  -  Xi/y 


U’.?) 


Letting  X^/y  =  f^,  Equation  (2.2)  becomes: 


T  -±  J 

V 


£. 

— ~~  +  f  .p! 
-  f.  ii 


(2.3) 


where 

f. 


average  bit  rate  on  channel  i  [bits/sec] 
VP: 


The  average  delay  T  is  the  most  common  performance  measure 
for  S/F  networks,  and  the  multicommodity  problem  consists  of 
finding  that  routing,  or  flow  pattern  F,  which  minimizes  T. 
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«e  may  now  pose  two  problems: 
Problem  A:  "Routing  assignment" 


Given: 


Minimize:  T(f 

over  f 

Constraints:  (i) 


Topology,  channel  capacities  and  a  require¬ 
ment  matrix  R 

T(f)  ■  i  I  (5TT-T  *  pp 

i«l  1  i 

(i)  £  is  an  m.c.  flow 


(n|  C^>  i  =  3,  •  •  •  i  o 

The  problem  is  in  the  standard  multi corauod  i  cy  form*  and  the 
additional  constraints  are  capacity  constraints.  Let  F^  be  the 

set  of  feasible  flows  for  Problem  A:  F  =  F  H  {fjf  <  c}. 

Clearly  F  is  a  convex  set  (intersection  of  convex  sets) . 

A 

A  second  interesting  problem  in  S/F  networks  is  formulated 
below  Assume  that  we  lave  a  given  network  topology  in  which 
the  channel  capacities  have  to  be  assigned.  A  cost  is  associated 
with  the  values  of  the  capacities,  and  the  total  cost  of  the  net¬ 
work  is  given.  In  addition,  the  flow  routes  must  be  determined. 
The  problem  statement  is: 

Problem  B':  "Routing  and  capacities  assignment,  general 
cost-capacity  function" 


Given: 


Minimize: 
over  Q,f 
Constraints: 


Topology,  requirement  matrix  R,  number  of 
dollars  available  D 

T(c.f)  -  i  f  <c-r-T*pp  fi 

1=1  1  1 

(i)  f  is  an  m.c,  flow 

(ii)  —  CiJ  *■  =  •••»*> 

b 

(iii)  7  d.(C.)  <  D 

i=l  1  1  ” 


where 


c  =  (c1,c2,...Jcb) 

d^(C^)  =  arbitrary  cost-capacity 
function  for  arc  i 

The  minimization  can  be  carried  out  first  on  C,  keeping  f  fixed, 
and  then  on  f. 

*The  possibility  of  formilating  the  routing  problem  as  a  multi¬ 
commodity  flow  problm  was  already  recognized  by  Frank  and  Chou 
in  [24].  An  interesting  linear  progreaming  approach  is  pre¬ 
sented  there. 


zz 
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If  the  cost-capacity  functions  are  linear  (i.e. , 
d^(C^)  =  d^C^) ,  then  the  minimization  over  C  can  easily  be  per¬ 
formed  by  the  method  of  Lagrange  multipliers  and  v,*  get  the 
following  optimum  capacities  as  functions  of  the  flows  [1]: 


D  /f .d. 
e  11 


where 


Ci"fi  +  d.  b 

j-i  3  3 


(2.4) 


D  =■  D  -  y  f.d. 

A  *-  11 

i=l 

By  introducing  Equation  (2.4)  into  the  expression  of  T(C,f)  we 
have: 

b 


Since 


and 


then 


and 


T(C,f)  =  T  (f ) 


(155)'  , 


I  Vi 

i=l 


+  7  I 

yd6  Ti-1 


for  (iii) 


b  b 

I  d.C.  >  y  d.  f.  for  (ii) 
l  l  —  i  l 


i=l 


i=l 


D  >  T  d.f. 
—  ii 

1=1 


(2.5) 


D  =■•  D  -  y  d ,  f .  >  0  (iv) 
e  l  l  — 

i=l 


It  i  easy  to  see  from  Equation  (2.4)  that  (iv)  implies  also 
(ii)  and  (iii) j  hence  both  (ii)  and  (iii)  can  be  replaced  by 
(iv). 

By  introducing  Equation  (2.5)  into  Probltsr.  B*  and  using 
result  (iv) ,  we  obtain t 
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Problem  3:  "Routing  and  capacities  assignment,  linear 
cost-capacity  function" 

Given:  Topology,  requirement  matrix  R,  number  of 

dollars  D 


Minimise: 
over  f 

Constraints: 


(i)  f  m.c.  flow 

(ii)  D  >  0 

e  — 


Again  the  problem  is  reduced  to  an  optimal  flow  problem  of  the 
standard  multicommodity  form.  The  additional  constraint  is  now 
a  cost  constraint.  Let  F^  be  the  set  of  feasible  flows  for 
Problem  B: 


b 

F  =  F  0  {f  |  D  -  l  d  £  >  0} 
i=l 

Clearly  Fb  is  convex. 

The  inspection  of  Problems  A  and  B  motivates  the  following 
important  observation: 

Observation: 

In  both  Problems  A  and  B,  the  performance  T(£)  goes  to  <» 
whenever  f  approaches  the  boundaries  defined  by  the  ad¬ 
ditional  constraints  (i.e.,  when  any  channel  becomes 
saturated  in  A,  or  when  the  excess  dollars  D  reduce  to 
zero  in  B) . 


Using  mathematical  programming  terminology,  the  performance 
T(f)  incorporates  the  additional  constraints  as  penali-y  functions. 
From  a  practical  point  of  view,  such  a  property  is  very  important: 
it  guarantees  the  feasibility  of  the  solution  (with  respect  to 
the  additional  constraints)  during  the  application  of  usulI  non¬ 
linear  minimization  techniques,  provided  a  feasible  storting  flow 
is  found. 

The  property  is  quite  general  for  S/I’  networks:  when  the 
additional  constraints  are  satisfied  with  equality,  usually  some 
saturation  occurs,  the  queues  at  nodes  grow  large  and  the  delay 
T  increases  rapidly. 

As  a  consequence  of  the  above  observation,  if  we  assume 
that  a  feasible  starting  solution  can  be  found,*  we  can  disregard 


*Teckniques  for  finding  feasible  starting  solutions  are  shorn  in 
the  applications  section. 
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the  additional  constraints  and  approach  Problems  A  and  B  as 
unconstrained,  a.c.  flow  problems.  Problems  A  and  B  will  be 
investigated  further  in  later  sections. 

3.  THE  PD  METHOD  AS  AN  APPROACH  TO  THE  SOLUTION  OF 
NON-LINEAR  M.C.  FLOW  PROBLEMS 

In  order  to  place  the  Flow  Deviation  (FD)  method  in  the 
proper  perspective  in  relation  to  the  existing  methods,  it  is 
convenient  to  classify  the  various  a.c.  flow  problems  into 
categories;  for  each  category,  the  solution  techniques  avail¬ 
able  in  the  literature  are  reviewed  and  the  contribution  of 
the  FD  method  is  discussed. 

a)  Unconstrained  M.C.  Flow  Problems 

a.l)  Linear  performance.  The  linear  min  cost  flow  problem  with 
no  constraints  on  capacity  has  the  well  known  shortest  route 
solution  (where  the  arc  length  is  equivalent  to  the  linear  cost 
of  the  arc)  (9,12).  Very  efficient  techniques  are  available  for 
the  evaluation  of  all  shortest  routes  on  a  graph  and  for  the 
routing  of  the  commodities  along  such  routes  [9,16];  therefore 
it  appears  convenient  to  reduce  complicated  flow  problems  (i.e., 
non-linear,  or  constrained)  to  the  linear,  unconstrained  form, 
which  can  be  solved  efficiently. 

a.  2)  Lon-linear  performance.  The  most  natural  thing  to  do  is 
to  linearize  the  problem.  Problems  which  are  separable*  and 
convex  can  be  linearized  by  approximating  the  convex  functions 
with  piecewise  linear  functions  and  by  introducing  one  supple¬ 
mentary  variable  and  one  constraint  equation  for  each  linearized 
segment  [11,15,24].  This  method  has  two  serious  drawbacks: 
first,  it  can  be  applied  only  to  separable  and  convex  problems; 
secondly,  the  number  of  variables  and  constraints  becomes  pro¬ 
hibitively  large  for  large  networks. 

Another  method,  which  applies  to  differentiable  problem, 
consists  of  approximating  the  performance  function  with  :he 
tangent  hyperplane,  which  is  expressed  in  terms  of  the  partial 
derivatives  {3P/3f^}.  The  min  cost  solution  of  the  linearized 

problem  is  the  shortest  route  flow,  where  the  length  of  arc  i 
is  defined  as  3P/3th  .  As  it  will  be  shown  later,  such  shortest 

route  flow  represents  the  direction  of  the  steepest  descent  flow 
deviation. 


flow  problem  has  the  form: 
b 


*A  separable  m.c. 
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The  above  idea  is  the  essence  of  the  FD  method,  which  con¬ 
sists  of  repeated  evaluations  of  steepest  descent  directions  and 
of  one  variable  minimizations  along  such  directions;  the  method 
{described  in  Section  5)  is  conceptually  very  similar  to  the 
gradient  method  applied  to  non-linear  minimization  problems.  If 
the  problem  is  differentiable,  the  FD  method  is  clearly  superior 
to  the  supplementary  variables  method  mentioned  before:  it  does 
not  add  new  variables  and  constraints,  and  can  be  applied  to  non- 
convex,  non-separable  cases. 

In  fact,  the  idea  of  using  shortest  routes  (computed  with 
partial  derivatives)  for  the  solution  of  non-linear  problems  is 
not  new:  using  such  techniques,  Dafermos  [17]  solved  various 
traffic  problems,  formulated  as  unconstrained,  convex  m.c.  flow 
problems,  and  Yaged  [18]  solved  a  min  cost  capacity  assignment 
for  a  communications  network,  which  was  formulated  as  an  uncon¬ 
strained,  concave  m.c.  flow  problem. 

Dafermos  stated  the  conditions  for  the  optimality  of  the 
solution  and  proposed  am  algorithm  for  finding  the  optimal  rout¬ 
ing  in  the  convex  case;  the  algorithm,  however,  is  impractical 
for  large  nets,  as  it  requires  the  bookkeeping  of  all  paths  for 
all  comnodities  [17].  Yaged' s  results,  on  the  other  hand,  are 
very  restricted:  they  apply  only  to  a  separable,  concave  prob¬ 
lem  [18] . 

In  this  paper,  we  attempt  a  more  general,  systematic  in¬ 
vestigation  of  the  method;  we  introduce  the  main  results  in  a 
more  straightforward  way  and  in  a  simpler  formulation  than  in 
[17] .  We  indicate  an  algorithm  which  is  applicable  to  non- 
separable  problems  and  which  has  been  efficiently  applied  to 
large  nets. 

b)  Constrained  M.C.  Flow  Problems 

b.l)  Linear  performance ,  linear  constraints.  The  classical, 
and  most  efficient,  approach  is  the  Dantzig-Wolfe  decomposition 
[13,14],  which  reduces  the  solution  of  the  main  problem  to  the 
repeated  solution  of  a  Master  Problem  and  a  Subproblem.  The 
Master  is  a  linear  program  containing  the  additional  constraints, 
and  the  Subproblem,  which  generates  new  columns  to  introduce 
into  the  Master,  is  an  unconstrained  linear  min  cost  flow  prob¬ 
lem. 

b.Z)  Non-lincar  performance,  non-linear  constraints.  The 
general  theory  of  non-linear  problems  with  non-linear  constraints 
is  very  hard.  The  special  case  of  convex  performance  and  concave 
non-negativity  constraints,  however,  can  be  attacked  efficiently 
with  the  Dantzig-Wolfe  decomposition  for  convex  programs  [11]; 
the  Master  Problem  is  a  linear  program,  and  the  column  generating 
Subproblem  is  an  unconstrained  convex  min  cost  flow  problem. 

Here  is  another  important  area  of  application  for  the  FD  method. 


o 
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We  shoved  that  the  two  design  problems  considered  in  the 
paper  can  be  regarded  as  unconstrained  m.c.  flow  problems ,- 
therefore,  in  the  sequel,  unless  otherwise  specified,  we  refer 
to  unconstrained  problems. 

4.  STATIONARITY  CONDITIONS 


Let  us  assume  that  P(f)  is  continuous  with  its  first  par¬ 
tial  derivatives.  We  want  to  establish  necessary  and  sufficient 
conditions  for  f  to  be  stationary. * 

The  most  general  perturbation  (which  we  define  as  flow 
deviation)  iround  f  can  be  obtained  as  a  convex  combination  of 
f  with  any  m.c.  flow  v.  The  result  of  such  flow  deviation,  f1, 
is  expressed  as: 


f'  =  (1  -  A)f  +  Av  =  f  +  A(v  -  f) 


where 


veF,  0  <_  A  <_  1 

If  A  -*■  0,  the  flow  deviation  is  infinitesimal .  For  X  =  6 A  «  1, 
we  have: 

.  b 

6P(f)  P(f’)  -  P(f)  -  6A  l  £k(vk  -  fk)  (4.1) 

k=l 

where 

i  -IE 

k  3f. 
k 

From  Equation  (4.1)  and  f.-om  the  definition  of  stationarity,  f 
is  stationary  if: 


I  £k{vk  '  V  -0,  - 


(4.2) 


k--l 


We  can  also  produce  infinitesimal  perturbations  that  involve 
only  one  of  the  commodities;  f  must  be  stationary  with  respect 
to  any  one  of  them  separately.  It  follows  that  f  is  stationary 
if,  for  all  (i,j)  commodities: 


I  Uv«ij)-f'ij))>0,vy£F 


(ij) 


(4.3) 


k=l 


*£  is  defined  as  stationary  if,  for  any  infinitesimal  perturba¬ 
tion  &£  (such  that  f  +  Sf  is  also  m.o.  flow)  we  have 

P(f  +  6f)  >  P(f) 

A  local  minimum  is  always  stationary ;  the  opposite ,  however , 
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(il) 

where  F  J  is  the  set  of  the  feasible  (i,j)  commodity  flows. 
In  fact.  Equations  (4.2)  and  (4.3)  a/e  equivalent,  as  will  be 
seen  from  the  subsequent  derivations.  Condition  (4.2)  can  be 
remitten  as: 


b  b 

min  V  £  v  >  7  i  f 

„  r  ,  ,  k  k  -  L  k  k 
ver  k=>l  k=l 

But,  as  f  e  F,  Equation  (4.4)  becomes: 


min  I  lvv. 

veF  k-»l 


b 

l 

k=l 


Similarly,  Equation  (4.3)  becomes: 

b 


*.  I  v(iJ’ 


k-1 


“kVk 


b 

l 

k-i  k  k 


(ij) 


(4.4) 


(4.5) 


(4.6) 


Condition  (4.5)*  is  easy  to  check:  the  right  hand  side  can  be 
directly  evaluated,  and  the  left  hand  side  requires  the  compu¬ 
tation  of  the  shortest  route  flow  under  the  metric  {£,  }. 

k 

If  we  represent  the  m.c.  flow  as  a  collection  of  weighted 
routes  (see  Section  1) ,  Equation  (4.6)  becomes: 


min  J 
TT '  k^TT* 


r .  , 
k 


NP 

l  l 

m=l  keir 


£.  (a  r.  .) 
k  m  13 


m 


(4.7) 


where 

ir* 

is 

any 

m 

m  » 

1. 

m 

m  = 

1. 

NP 

is 

the 

Let 

£  (if)  - 

(i, j)  route 

...»  NP,  are  the  (i,j)  routes  used  by  commodity  (i,j) 

...,  NP,  are  the  associated  weights 

total  number  of  routes  used  by  commodity  (i,j) 

\  £  ;  Equation  (4.7)  becomes: 

keir 


min  £ (tt * ) 

7T  * 


m=l 


a  £(tt  ) 
m  m 


(4.8) 


*A  different  derivation  of  Equation  (4.5)  is  given  in  [19]. 
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Recalling  that  a  >  0,  ym,  and  J  a  =  1,  we  obtain,  for  all 


comnodities 


l(nx)  *  £(it2)  =  ...  <  i(n') 


(4.9) 


where  it'  is  any  (i,j)  route. 

Condition  (4.9)  is  stated  also  in  [17];  a  similar  equi¬ 
librium  condition  was  mentioned  by  Wardrop  [20].  In  fact,  the 
condition  is  very  intuitive:  it  states  that  all  non-zero  weight 
routes  must  have  the  same  marginal  "gain,"  whereas  the  zero- 
weight  routes  must  be  less  (or,  at  most,  equally)  convenient 
than  the  weighted  ones.  For  an  immediate  interpretation  of 
Equation  (4.9),  suppose  there  are  two  paths,  and  ir^,  both 

with  non-zero  weight,  which  do  not  satisfy  Equation  (4.9),  i.e., 
£(ifp)  >  i(ir  ),  say.  An  infinitesimal  deviation  of  commodity 

(i,j)  from  ir  to  it  produces  a  variation  6P  <  0;  therefore,  the 

initial  flow  configuration  was  not  stationary. 

Notice  that  test  (4.5)  is  computationally  more  convenient 
than  test  (4.9),  as  (4.5)  only  requires  the  knowledge  of  the 
global  flow,  while  (4.9)  requires  the  knowledge  of  all  the  paths 
[19]  . 

The  question  remains,  whether  the  stationary  point  is  a 
local  (or  global)  minimum.  If  P(f)  is  strictly  convex,  the 
stationary  point,  if  it  exists,  is  unique  and  is  a  global  min. 

If  P(f)  is  not  convex,  further  considerations  are  required. 

5.  DESCRIPTION  OF  THE  FD  METHOD 


The  results  of  the  previous  section  indicate  that,  if  f  is 
not  a  stationary  flow,  then  the  shortest  route  flow  (evaluated 
under  the  metric  =  SP/Sf^)  represents  the  flow  deviation  of 

steepest  decrease  for  P.  This  fact  suggests  a  method,  which  we 
call  Fiji)  Deviation  method,  for  the  determination  of  stationary 
solutions  of  unconstrained,  non-linear,  differentiable  flow  prob¬ 
lems  P(f)  . 

The  FD  can  be  regarded  as  an  operator  (denoted  by  FD(v,A)G) 
which  maps  an  m.c.  flow  f  into  another  m.c.  flow  f '  and  is  de¬ 
fined  as  follows: 


where 


FD(v,A)  0f  “  (1  -  A)f  +  Av  =  f' 

•r  ~  ^  » 

v  is  a  properly  chosen  m.c„  flow  e  F 
A  is  the  step  size  (0  <  A  <  1) 
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Clearly  FD  is  a  map  of  F  onto  itself: 

FD(v,A)  :  F  ■*  F 

Now,  for  each  £  e  F,  we  want  to  determine  a  pair  (v,A)  in  such 
a  way  that  the  repeated  application  of  FD(v,A)  (starting  from 
0  n 

any  flow  f  ),  produces  a  sequence  (f  }  which  converges  to  a 
stationary  flow.  If  we  can  define  such  a  FD(v,A) ,  then  we  have 
an  algorithm  for  the  determinat ion  of  stationary  flows. 

It  can  be  shown  [21]  that,  for  a  function  P(f)  which  is 
continous,  nondegenerate*  and  lower  bounded,  the  following  con¬ 
ditions^  are  sufficient  for  the  convergence  of  an  FD-mapping 
to  a  stationary  flow: 

(i)  AP ( f )  >_  0  V  f  e  F 

(ii)  AP(f)  =  0  =>  f  stationary 

where  AP(f)  =  P(f)  -  P(FD©  f) 

Conditions  (i)  and  (ii)  require  that  the  FD  method  be  a  true 
steepest  descent  method. 

Again  in  [21]  it  was  shown  that  under  reasonable  assump¬ 
tions®  on  P(f},  the  following  definition  of  FD(v,A)  satisfies 
conditions  (i)  and  (ii) : 

v  =  shortest  route  flow  under  metric  Z,  11 

~A  k 

A  =  minimizer  of  P[(l  -  A)f  +  Ay],  0  <_  A  <  1  (5.2) 

*P(f)  is  defined  to  be  nondegenerate  if3  for  any  two  distinct 
stationary  flows ,  say  f*  and  f^,  we  have: 

Ptf1)  /  P(f). 

+ Similar ,  but  more  restrictive  conditions  were  stated  by  Dafermos 
in  117]. 

§The  assumptions  are:  P(f)  continuous  and  lower  bounded ;  first 
partial  derivatives  continuous  and  nonnegative ;  second  partial 
derivatives  <  +  P(£)  nondegenerate.  The  nonnegativity  of 

the  first  partial  derivatives  i3  a  reasonable  assumption ,  as, 
in  general ,  the  performance  that  we  want  to  minimize  is  an  in¬ 
creasing  function  of  the  flow  in  each  arc. 

11 Notice  that,  by  assumption ,  Z ^  -  dP/'df^  >_  0;  this  fact  excludes 

the  presence  of  negative  cycles,  which  would  nave  caused  the 
failure  of  the  shortest  route  computation  (and  therefore  of  the 
FD  algorithm) . 
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Another  valid  definition  of  FD  is  the  following. 


Let: 


=  shortest  (i,j)  path  (under  metric  l ) 
^  longest  (i,j)  path,  with  ct?j  >  0 


Define  (i,j)  -  deviation  as  the  deviation  of  commodity  (i,j) 

from  to  which  minimizes  P(f).  Define  the  FD  operator 

as  the  composition  of  all  (i,j)  deviations:  such  a  definition 
satisfies  (i)  and  (ii).* 

A  general  algorithm,  based  on  the  first  definition  of  the 
FD  operator,  is  outlined  as  follows: 

1.  Find  a  feasible  starting  flow  f^ 

2.  Let  n  =  0 

-»  *n+ 1  .  n  .  n .  -  n 

3.  f  =  FD(v  ,a  )  @  f 

4.  If  (P(fn)  -  P(fn+1)}  <  e,  /or  if  l 

\  k=l 

where  e  and  e  are  acceptable  positive  tolerances,  stop. 

Otherwise,  let  n  =  n  +  1  and  go  to  3. 

The  algorithm  converges  to  stationary  points;  however,  the 
only  stationary  points  of  stable  equilibrium  are  the  local  minima, 
so  we  can  assume  that  the  algorithm  converges  to  local  minima. 

In  the  case  of  P(f)  strictly  convex,  the  algorithm  converges 
to  the  global  min  (see  Appendix  I  for  a  proof  of  convergence  and 
an  upper  bound  on  the  error) . 

For  P(£)  non-convex,  one  should  explore  all  local  minima, 
in  order  to  find  the  global  minimum.  However,  a  systematic 
search  is  impossible,  for  large-size  networks,  so  heuristic  ap¬ 
proaches  (like  the  repeated  application  of  the  FD  algorithm  to 
various  initial  flow  configurations)  have  to  be  devised.  In  the 
case  of  P(£)  concave  (or  quasi-concave  [23]),  the  local  minima 
correspond  to  extreme  points  of  F,  i.e. ,  to  shortest  route  flows 
[23] :  this  property,  as  shown  later,  greatly  simplifies  the  FD 
algorithm  and  speeds  up  its  convergence. 

In  the  following  sections,  the  FD  method  is  applied  to  the 
solution  of  Problems  A  and  B. 


‘X- 


n. 

V 


<  G 


*Suoh  an  FD  operator  is  essentially  the  "equilibration  operator" 
defined  by  Dafermos  [273. 

t Such  a  teit  is  obtained  directly  from  the  stationarity  condi¬ 
tion  (3.5). 
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6.  THE  ROUTING  ASSIGNMENT 


Let  us  consider  Problem  A  in  Section  2.  The  performance 

T(f)  (see  Equation  (2.3))  is  strictly  convex  (separable  sum  of 

strictly  convex  fun  .ions) ,  and  the  feasible  set  F  is  a  convex 

A 

polyhedron.  Therefore,  if  the  problem  is  feasible,  there  is  a 
unique  stationary  point,  which  is  the  global  minimum.  The  ad¬ 
ditional  constraints  are  included  in  T(f)  as  penalties;  there¬ 
fore,  if  we  can  find  a  feasible  starting  flow  f^  e  F  ,  Problem 

A 

A  can  be  regarded  as  am  unconstrained  m.c.  flow  problem  and 
solved  with  the  FD  method. 

Let  us  check  if  T(f)  satisfies  the  conditions  fcr  the  con¬ 
vergence  (see  Section  5) .  The  first  and  second  partial  deriva¬ 
tives  are: 


32T 


»fi»fj 


0  for  i  f  j 


2C. 

l 


y  <c. 


for  i  =  j 


(6.1) 


(6.2) 


From  Equation  (2.3),  the  optimal  solution  f*,  if  it  exists  (i.e., 
if  the  problem  is  feasible) ,  satisfies  the  capacity  constraints 
as  strict  inequalities  (f*  <  C.  Vi).  Therefore,  we  can  find  an 
e  >  0  s.t. :  1 

f*  e  F'  =  F‘  n  (f|f.  <  C.  -  z)  (6.3) 

A  - 1  l  —  l 

The  application  of  the  FD  method  can  be  restricted  to  F'  C  F  ; 

A  A 

for  f  e  F  j  the  sufficient  conditions  on  the  first  two  deriva- 

"*  A 

tives  of  P(f)  (as  from  Section  5)  are  satisfied;  therefore  the 
FD  algorithm  converges  to  the  global  minimum. 

In  order  to  find  a  flow  f°  e  F_.  several  methods  are  avail- 

-  A 

able.  One  of  them  was  described  in  (19).  Another  method  (ap¬ 
plied  below)  consists  of  picking  any  f  e  F,  and  then  reducing 
the  flows  in  all  arcs  by  a  scaling  factor  RE,  until  a  feasible 

flow  f°  =  RE*  f  t  F.  is  obtained;  f°  satisfies  a  reduced  require- 
A  -  Q 

ment  matrix  RQ  =  RE*R.  The  FD  method  is  applied  using  f  as 
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starting  flow  and  RQ  as  starting  requirement;  after  each  FD 

iteration,  the  value  of  RE  is  increased  up  to  a  level  very  close 
to  saturation.  The  search  for  a  feasible  flow  terminates  when 
one  of  the  two  following  cases  occurs:  either  RE  >  1,  and  a 
feasible  flow  is  found;  or  the  network  is  saturated,  T(f)  is 
minimized  and  RE  <  1.  In  the  latter  case  the  problem  is  in¬ 
feasible  and  we  are  finished. 

The  FD  algorithm  for  the  solution  of  the  routing  problem 
consists  of  two  phases,  Phase  1  and  Phase  2.  In  Phase  1  a 

feasible  flow  f°  is  found  (if  it  exists) ,  or  the  problem  is  de¬ 
clared  infeasible.  In  Phase  2  the  optimal  routing  is  obtained. 
The  algorithm  is  outlined  as  follows: 

Phase  1: 

0.  With  REg  =  1,  let  f°  be  the  shortest  route  flow  computed  at 
f  *  0,  i.e.  with  metric  =  OT/Sf^]  =  l/y(l/C.  +  p^).* 
Let  n  *  0.  ^k  ® 

1.  Let  a  -  max 

n  k 

If  a  /RE  <1,  let  f°  =  fn/KE  and  go  to  Phase  2.  Otherwise, 
n  n  -  ~  n 

let  RE  ,,  =  RE  (1  -  e  (1  -  a  ))/a  .  where  e  is  a  proper 
n+i  n  n  n 

tolerance,  0  <  t  <  1. 

Let  gn+1  =  fn(RE  .,/RE  ).+  Go  to  2. 

~  -  n+1  n 

2.  Let  f"+1  =  FD  ©  gn+1 

where  FD  is  defined  as  in  Equation  (5.2). 

3.  If  n  =  0,  go  to  5. 

*The  shortest  route  n . .  is  therefore  the  route  for  which 

I'd 

(p^  +  1/C^)  is  minimum.  Notice  that  1/C 7^  is  the  trccns- 

.  •  f  , 

mission  delay  per  hit  on  channel  k  and  p ^  vs  the  propagation 

delay.  No  queueing  delay  is  considered  as  the  traffic  is  zero 
(f^  =  0).  So,  as  we  expect,  for  f^  0,  the  shortest  route 

it  . .  minimizes  the  sum  of  transmission  +  propagation  delay, 
vj 

* g  is  a  feasible  m.c.  flow  with  requirement  SEn+ 


l 

ken . . 
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4‘  11  II,  *k<\  -  \L)\  <  6  and  Ire^  -  RE  I  <  J, 

k=l 

where  0  and  6  are  proper  positive  tolerances,  and  v  is  the 

shortest  rou.;e  flow  computed  at  gn+\  stop:  the  problem  is 
infeasible  within  tolerances  6  and  5.  Otherwise,  go  to  5. 

5.  Let  n  =  n  +  1  and  go  to  1. 

Phase  2: 

0.  Let  n  *=  0. 

i  «rn+l  ^ 

1.  f  =  FD  <3>  f 

2*  If  II  *k<vk  "  ^1  <  0'  where  0  is  a  proper  positive  tol¬ 
erance,  stop:  f11  is  optimal  within  a  tolerance  0. 

Otherwise,  let  n  =  n  +  1  and  go  to  1. 

The  algorithm,  in  the  form  described  above,  provides  only 
the  optimum  global  flow  f.  If  complete  information  about  the 
routes  take.<  by  each  commodity  is  required,  a  simple  updating 
of  routing  tables  at  each  FD  iteration  allows  one  to  recover  it 
at  the  end  of  the  algorithm  (see  [19]). 

7.  NON-BIFUR IATED  ROUTING  FOR  LARGE  AND  BALANCED  NETS 

An  m-c.  flow  is  defined  to  be  non-bi furcated  if  each  com¬ 
modity  flora  along  one  route  only.  Some  applications  require 
a  non-bifurcated  routing  assignment;  in  some  other  applications 
the  non-bifurcated  solution  is  a  very  good  approximation  to  the 
optimum  bifurcated  one,  and  is  obtained  with  considerable  saving 
in  the  amount  of  confutation  (see  below) .  The  above  reasons 
motivate  an  investigation  of  the  non-H  furcated  routing  assign¬ 
ment. 

The  introduction  of  the  "non-bifurcation"  constraint  re¬ 
duces  the  set  of  feasible  m.c.  flows  to  a  discrete  set:  the 
number  of  elements  in  the  set  is  equal  to  the  number  of  all 
possible  combinations  of  ir^  paths,  V  i,j.  Continuous  tech¬ 
niques,  like  the  FD  method,  cannot  in  general  be  used;  discrete 
techniques,  on  the  other  hand,  are  very  involved  and  computa¬ 
tionally  prohibitive  already  for  networks  of  medium  size  (on. 
the  order  of  ten  nodes).  It  is  of  interest  to  devise,  therefore, 
efficient  sub-optimum  techniques.  We  will  show  that,  in  the 
important  case  of  "large  and  balanced  networks,"  a  modification 
of  the  FD  method  can  be  successfully  applied. 


34 


THE  FLOW  DEVIATION  METHOD  115 


A  network  is  said  to  be  large  if  it  has  a  large  number  of 
nodes;  it  is  said  to  be  balanced  if  the  elements  r^_.  of  the  re¬ 
quirement  matrix  R  are  not  highly  diversified  one  from  the  other. 
For  a  more  precise  definition  of  "balanced,"  let  r: 

a  i  y 

r  (n  -  l)n  . .  rij 
ID 

be  the  average  requirement  per  pair  of  nodes  and  let  m: 

m  =  max  [r.  ./r] 

(ij)  13 

be  the  ratio  between  the  max  and  the  average  requirement.*  No¬ 
tice  that  m  >_  1  and  that  m  *»  1  corresponds  to  a  uniform  require¬ 
ment  matrix.  A  network  is  said  to  be  balanced  if  m  is  close  to 


We  now  combine  these  ideas  into  the  notion  of  "large  and 
balanced  net."  Let: 

A  Km  „ 

n  =  - — r  (7.1) 

(n  -  l)p 

where:  K  =  b/n,  the  average  arc  to  node  density  of  the  graph. 

p'  =  ^  I  rij>  where  Pij  is  the  length  of  the 

shortest  (i,j)  path  (length  of  a  path  =  number  of 
arcs  in  the  path) ;  p1  is  therefore  the  average 
path  length,  when  all  commodities  are  routed  along 
the  shortest  paths. 

A  network  is  defined  large  and  balanced  if  n  «  1.  In  order  to 
motivate  such  a  definition,  let  us  consider,  for  an  arbitrary 
m.c.  flow  f,  the  ratio  of  the  total  flow  f  in  arc  k,  versus  the 

““  t  ■*  »  %  *• 

contribution  f^  3  given  by  any  commodity  (i, j) .  Let  us  evalu¬ 
ate  the  average  of  this  ratio,  taken  over  all  arcs: 


average 


1  i 

b  k=i 


>  —  i  f 

-  hour  k 


*Many  other  appropriate  definitions  of  m  are  possible,  for  ex- 

l  r/  rii  , 

ample  m  -  £(2  -  >  in  which  case  m  =  0  corresponds 

to  the  uniform  traffic  requirement. 
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It  was  shown  by  Kleinrock  [1]  that: 

L 

l  f v  “  r(n  -  l)n 
k»l 


where : 


-  A 

P  = 


(k  Ii)pn)/lriy 


and  p. .  is  the  number  of  arcs  in 
13 


>ij  -  '/i 

(i,j)  route,  relative  to  the  routing  assignment  under  considera 
tion;  p  is  therefore  the  average  path  ler.  jth.  * 

Equation  (7.2)  Becomes: 


average 


/  fk\ 

Luj)] 

\  k  / 


>  (n  ~  l)n«p  >  (n  -  l)p 
bm  —  Km 


1/n 


(7.3) 


From  (7.3)  the  following  property  holds: 

Property  (7.1):  In  a  large  and  balanced  net,  on  the  average, 
the  contribution  of  one  single  cormodity  in  any  arc  can  be  con¬ 
sidered  infinitesimal,  as  compared  to  the  total  flow  in  that 
arc. 

In  order  :o  show  how  the  FD  method  applies  to  the  non- 
bifurcated  solution  of  large  and  Balanced  nets,  let  us  consider 
a  new  version  of  flow  deviation,  defined  as  the  composition  of 
deviations  involving  only  one  commodity  at  a  time.  Suppose 
that  the  flow  f  is  non-bifurcated;  that  commodity  (i,j)  flows 


ij' 

usual  metric 


I 

'ij 


on  tt.,;  and  that  tt " is  the  shortest  (i,j)  route,  vnder  the 


The  FD  method  dr.iates  a  proper  amount 


A*r„,  (0  £  A  1)  ,  of  (i,j)  commodity  from  n\ ^  to  such 

that  the  performance  T(A) : 


T (A)  =  T (f  (1  -  A)  +  vA) 


where :  f  contains 


v  contains 


^3 

■ 

7T.  . 

13 


T(A)  -  T  (0)  +  A 


b 

I 

k=l 


Ak(vk  "  fk}  +  °U(Y  -  V  J 


(7.4) 


is  minimized.  We  can  rewrite  Equation  (7.4)  as  follows: 


(7.5) 


* Notice  that  P  depends  on  the  particular  routing  assignment, 
while  p’  depends  on  requirement  matrix  and  topology  only;  also 
notice  that  p  >  p'. 
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where  0(  )  contains  the  terms  of  order  higher  than  1.  Due  t<~ 
Property  (7.1) ,  the  terms  (v^  -  f^)  can  be  considered  as  in¬ 
finitesimal,  and  the  term  0{  )  is  infinitesimal  of  order  higher 
than  1.  Therefore,  as  long  as  0,  defined  as: 


is  sufficiently  negative,  the  term  0(  )  can  be  disregarded  and 

the  minimizer  of  T(X)  in  Equation  (7.5)  is  at  the  boundary 

(X  .  =1) t  hence  the  FD  method  preserves  the  non-bi furcated 

min 

characteristic  of  the  flow.  On  the  other  hand,  if  0  vanishes, 

the  higher  order  terms  become  important  and  it  mi^ht  happe’, 

that  X  .  <1;  however,  0=0  implies  that  f  is  very  close  U« 

min  - 

optimum  (see  Appendix  for  bounds  on  the  error).  Therefore,  the 

FD  method  provides  non-bi f’  “cated  solutions  which  are  very  good 

approximations  to  the  optimu "  bifurcated  solution,  and,  as  a 

consequence,  very  good  approximations  also  to  the  optimum  non- 

bifurcated  solution. 

The  non-bifurcated  FD  algorithm  is  next  introduced: 

Non-Bi furcated  FD  Algorithm 

Let  f°  be  a  starting  feasible  non-bifurcated  flow.* 

Let  n  =  0. 

1.  Compute  SR(fn) ,  defined  as  the  set  of  shortest  routes  under 
metric  {i^}. 

2.  Let  g  =  fn. 

For  each  commodity  (i,j): 

2. a  Let  v  be  the  flow  configuration  obtained  from  g  by 
deviating  commodity  (i,j)  to  the  shortest  route  irj, 
n 

given  by  SR(f  ) . 

2.b  If  [v  feasible  and  T(v)  <  T(g)],  go  to  2.c.  Other¬ 
wise,  go  to  2.d. 

2.c  g  =  v 

2.d  If  all  commodities  (i,j)  have  been  processed,  go  to  3. 
Otherwise,  go  to  2. a. 

3.  If  g  =  fn,  stop.  The  FD  method  cannot  improve  the  non- 

n+i 

bifurcated  solution  any  further.  Otherwise,  let  f  =  g, 
n  =  n  +  1  and  go  to  1. 

*Such  a  starting  flow  can  be  found  with  a  Phase  1  procedure , 
similar  to  that  described  in  Section  6, 
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The  algorithm  converges  in  a  finite  number  of  steps,  as  there 
are  only  a  finite  number  of  non-bi furcated  flows,  and  repeti¬ 
tions  of  the  same  flow  are  excluded  by  the  stopping  condition. 

An  application  of  the  algorithm  to  a  luge  and  balance! 
net  is  presented  in  the  application  section. 

8.  THE  ROOTING  AND  CAPACITIES  ASSIGNMENT 


It  wa3  shown  in  Section  2,  that  F0j  the  feasible  set  for 

Problem  B,  is  a  convex  polyhedron;  it  was  also  shown  that  the 
additional  constraint  is  included  in  the  performance  T(f)  as 
penalty  function,  so  that  Problem  B  can  be  regarded  as  an  un¬ 
constrained  m.c.  flow  problem. 

Let  us  now  investigate  the  properties  of  T(f).  Recall 
(see  Equation  2.5): 


T(f) 


<D  ■  L f a) 


1  f-p‘ 


(8.1) 


Kleinrock,  in  (1] ,  considered  this  case  and  also  dealt  exten¬ 
sively  with  a  simplified  version  of  Equation  (8.1)*  He  showed 

1  2 

that,  whenever  two  routes,  say  it.,  and  it..,  with  the  same  num- 

13  13 

ber  of  intermediate  arcs,  are  available  for  commodity  (i,j)  , 
then  T (f )  is  minimized  when  the  entire  commodity  is  routed  on 
one  of  the  two  routes  only.  Such  a  result,  obtained  under  re¬ 
strictive  assumptions,  suggests  the  conjecture  that  the  optimal 
flow  be,  in  general,  non-bifurcated.  In  fact,  further  research 
has  been  done  [21] ,  [22]  ,  and  it  can  be  shown  that  T(f)  in 
in  Equation  (8.1)  is  quasi- concave  on  F  ,  i.e.,  given  any  two 

12  ® 
feasible  flows  f  and  f  [23] : 

T (f1)  <  T(f2)  =>  TU1)  <  T  [  ( 1  -  A)  f1  +  \f2] 


where : 


A  <  1. 


More  gene  ally,  T(f)  can  be  shown  to  be  quasi-concave  for  all 
"routing  and  capacities  assignment"  problems  with  concave  cost- 
capacity  functions  [21];  the  linear  case  is  therefore  a  special 
case. 


* Essentially ,  d.  -  1  and  pi  =  0,  vi. 

I' 


J 
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As  a  consequence  of  such  a  property,  the  local  minima  are 

at  extreme  points  of  F  .  i.e.,  they  correspond  to  shortest 

B 

route  flows  (see  Section  3) ,  which  are  a  subclass  of  the  class 
of  non-bifurcated  flows. 

The  FD  method,  when  applied  to  Problem  B,  can  be  greatly 
simplified:  the  step  size  X  is  always  equal  to  1  (if  we  find 
a  downhill  direction,  we  go  all  the  way  down,  due  to  the  quasi¬ 
concavity  of  T(A)),  ant.  the  flow  patterns  generated  are  com¬ 
pletely  defined  by  just  one  (n  x  r)  matrix,  the  shortest  route 
matrix. 

A  schematic  description  of  the  ?D  algorithm,  as  applied 
to  Problem  t,  is  as  follows: 

0  r- 

0.  Suppose*  f  e  r^;  let  n  *  0. 

1.  Let  fn+at  a  pd®  f”, 

2.  If  (Vif*1)  >_  T ( f ") )  ,  stop;  f^  local  minimum.  Otherwise 
let  n  «  n  +  1  and  go  to  1. 

The  convergence  of  the  algorithm  is  guaranteed  by  the  fact  that 
there  are  only  a  finite  number  of  shortest  route  flows,  a:  J  repe¬ 
titions  of  the  same  flow  are  not  possible,  as  Tff11)  is  strictly 
decreasing. 

The  partial  derivatives,  used  for  the  shortest  route  compu¬ 
tation,  have  the  following  expression: 


Notice  that  -rj—  0;  negative  loops  cannot  exist.  *  ’  ~o  notice 
that: 


lim 
f  .-+-0 


3T 

3f . 


00 


which  means  that,  whenever  the  flow  (and  therefore  the  capacity, 
from  Equation  (2.4))  of  an  arc  is  reduced  to  zero  at  the  end  of 


*The  problem  of  finding  a  feasible  starting  flow  is  discussed 
later  in  the  section. 
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an  FD  iteration,  then  in  such  an  arc,  the  flow  and  capacity  are 
zero  for  all  subsequent  iterations,  as  the  incremental  cost  of 
restoring  the  flow  (=  3T/3fj)  is  infinity.* 


Fig.  2  Block  diagram  of  the  FD  algorithm  fo:*  Problem  B. 


*This  -property  suggests  a  method  for  the  design  of  the  topology: 
we  can  start  from  a  topology  which  is  highly  connected,  and 
eliminate  arcs  with  the  FD  method,  until  a  suboptimal  configura¬ 
tion  is  obtained  121].  A  similar  approach  is  used  by  Yaged  in 
[18). 
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The  FD  method  leads  to  a  local  minimum,  which  depends  on 
the  choice  of  the  feasible  starting  flow.  In  order  to  find 
several  local  minima,  a  mechanism  that  produces  a  large  variety 
of  feasible  flows  is  required.  We  propose  the  following  ran¬ 
domized  procedure  for  the  generation  of  feasible  flows:* 

1.  Assign  initial  equivalent  lengths  {£?}  to  the  arcs  at  random. 

2.  Compute  the  shortest  route  flow  f®  according  to  the  metric 

{£?}- 

b 

r*  Q  t 

3.  If  D  -  ^  f.d.  >0,  f  is  feasible  and  can  be  used  to  start 

i=l  1  1 

the  FD  algorithm.  Otherwise  f®  is  rejected. 

The  initial  random  choice  of  the  lengths  guarantees  a  cer¬ 
tain  randomness  in  the  starting  feasible  flow,  thus  providing  a 
method  for  finding  several  local  minima.  After  a  convenient 
number  of  iterations,  the  global  minimum  is  chosen  as  the  mini¬ 
mum  of  the  local  minima.  This  provides  a  "suboptimal"  solution. 

A  block  diagram  of  the  method  is  given  in  Figure  2. 


SRI  UTAH  NCAR  AWS  CASE  CMU  MITRE 


Fig.  3  A  21-node  ARPA  topology. 

9.  APPLICATIONS 

As  an  application  of  the  FD  method,  Problems  A  and  B  are 
solved  for  the  ARPA  Computer  Network.  The  ARPA  Computer  Network 
is  a  S/F  communication  network  connecting  several  computer 


*Another  procedure  was  proposed  by  Yaged  [18). 
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facilities  in  the  United  States.  A  detailed  description  of  the 
network  is  given  in  [3]  -  [8],  [25]  -  [29].  Due  to  the  fact 
that  new  computer  centers  are  continually  joining  the  network, 
its  topology  has  been  changing  quite  rapidly;  in  these  applica¬ 
tions  we  refer  to  one  of  the  earlier  proposed  topologies,  with 
21  nodes  connected  by  26  full  duplex  channels  (see  Figure  3) . 

We  also  assume  that  the  traffic  requirement,  is  uniform  between 
all  pairs  of  nodes. 

9. 1  ARPA  Network :  The  Routing  Assignment 

The  traffic  requirement  R  =  (r„)  is  assumed  uniform: 

r  =  1.187  [kbits. /sec. ]*  for  1  /  j 

0  for  i  =  j 

First,  we  show  that,  for  the  21  node  ARPA  net  with  uniform 
requirement,  the  "large  and  balanced  net"  condition  holds.  From 
Equation  (7.1)  ,  n  is  given  by: 

mb 

n  - - IT 

n(n  -  l)p 

In  the  present  case: 


n  = 

21 

—  • 

P  > 

1 

b  = 

52 

(each  full  duplex  channel 
of  directed  arcs:  hence 

represents  a  pair 
26  x  2  =  52). 

Hence: 

n  <  0.12  «  1 

The  condition  is  satisfied.  We  can  therefore  apply  both  optimal 
and  non-bifurcated  FD  algorithms  and  compare  the  results. 

The  result  of  the  optimal  FD  algorithm  is:  =  0.2406 

sec.,  obtained  after  80  shortest  route  computations,  with  an 
-4 

accuracy  of  10  on  T.  The  result  of  the  non-bifurcated  FD 

algorithm  is:  T  .  =  0.2438  sec.,  obtained  after  12  shortest 

min 

path  computations.  The  algorithms  were  programmed  in  Fortran 
and  run  on  an  IBM  360/91;  the  execution  time  was  30  sec.  for 


*The  traffic  requirement  at  saturation  is  r'sa^  ~  1-250 
[kbits. /sec.]  (see  Figure  4).  He  chose  r  =  0.95  ?saj.  =  2-187 
in  order  to  have  a  feasible ,  but  difficult ,  requirement. 
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the  optimal  algorithm  and  4  sec.  for  the  non-bifurcated  one.* 
The  error  of  the  suboptimal  non-bifurcated  solution,  with  re¬ 
spect  to  the  optimum,  is  less  than  2  percent;  the  fact  shows 
how  powerful  the  non-bifurcated  algorithm  is  for  large  and 
balanced  nets,  and  suggests  that  a  convenient  modification  of 
it  could  be  useful  for  the  solution  of  very  large  nets  [21]. 


Fig.  4  Average  delay  T  versus  normalized  traffic  RE, 
using  various  routing  schemas. 

Figure  4  illustrates  the  application  of  the  non-bifurcated 
algorithm.  Recall  that  RE  is  the  traffic  level  normalized  to 
r  =  1.187  kbits. /sec.  The  traffic  is  first  routed  along  the 
shortest  routes  computed  for  REQ  =  0;  curve  CQ  plots  the  delay 

T  versus  RE,  using  such  a  routing  scheme  (which  we  refer  to  as 
RSq) .  With  RSg,  the  saturation  level  for  the  traffic  is 

RESATq  =  .85  <  1;  RE  =  1  is  infeasible,  and  therefore  we  are 

still  in  Phase  1.  Let  f*  be  the  flow  obtained  by  routing  traf¬ 
fic  level  RE^  =  .95  RESAT ^  =  .8,  according  to  RS^  and  apply  to 

f^  the  FD  algorithm;  a  new  routing  scheme  RS^  is  obtained,  which 

improves  T(RE^) .  Curve  corresponding  to  RS^  saturates  at 


*We  expect  to  be  able  to  reduce  considerably  the  computation 
time  by  optimizing  the  code  and  by  improving  some  hard  working 
subroutines ,  like  the  shortest  route  and  flow  assignment  rou¬ 
tines  [25]. 
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RESAT ^  =  1.05  >  1;  RE  =  1  is  feasible  and  Phase  2  is  initiated, 

with  RE 2  3  1.  At  the  end  of  Phase  2,  the  sub-optimal,  non- 

bifurcated  routing  scheme  RS2  is  found;  curve  C  correspfV»ding 

to  RS2  practically  coincides  with  curve  in  Figure  4,  as  the 

scale  of  T  is  not  detailed  enough  to  show  differences  in  values. 
Notice  that,  as  expected,  the  routing  RSQ  gives  the  best  results 

at  low  traffic  levels;  in  fact,  RS  is  almost  optimal  up  to 
RE  =  0.5.  0 

9.2  ARPA  Network:  Routing  and  Capacities  Assignment 

The  set  of  channel  capacities  available  for  the  ARPA  Net¬ 
work  is  discrete:  Table  1  contains  the  list  of  capacity  options 
and  corresponding  costs  considered  in  the  present  application 
i 6] .  In  order  to  be  able  to  apply  the  FD  method,  the  discrete 
cost-capacity  curves  have  been  approximated  with  continuous, 
piece-wise  linear  curves  (see  Figure  5).  We  do  not  discuss  the 
details  of  the  approximation,  but  merely  mention  that  they  must 
be  concave.*  The  concavity  of  the  cost-capacity  curves  implies 
that  the  local  minima  are  shortest  route  flows  (see  Section  8) . 
The  FD  method  can,  therefore,  be  applied  in  a  form  similar  to 
the  one  presented  in  Section  8;  a  few  modifications  are  required 
due  to  the  ncn-linearity  of  the  cost-capacity  curves. 


CHANNEL  CAPACITIES  AND  CORRESPONDING 
COSTS  USED  IN  THE  OPTIMIZATION 


Capacity 

Termination  Cost 

Line  Cost 

[kbits/sec] 

[$/mcnth] 

[$/month/mile] 

7.2 

810 

.35 

19.2 

850 

2.10 

50 

850 

4.20 

108 

2400 

4.20 

230.4 

1300 

21.00 

Table  1 


Note:  The  total  cost  per  month  of  a  channel  is  given  by: 

total  cost  =  termination  cost  +  (line  cost)  x  (length 
in  miles) . 


*0ther  concave  approximations  can  be  considered:  see  [6],  [2#] . 
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d':  staircase  corresponding  to  discrete  capacity  levels, 
d" :  piece-wise  linear  approximation 

Fig.  5  Cost-capacity  curves  for  arc  i. 


A  schematic  description  of  the  algorithm  follows  here: 

0.  Let  Dg  be  the  total  dollar  investment. 

Let  f°  e  Fb 

0  0  * 
Let  C  be  the  optimal  capacities  assignment  for  fixed  f  . 

Let  Tq ( f)  be  as  from  Equation  (8.1),  using  linear  approxi¬ 
mations  of  the  cost-capacity  curves  around  C°. 

Let  n  =  0. 


Let: 

fn+1  =  shortest  rout-?  flow  computed  at  fn 

(using  metric  Hk  =  OT^JfJ/af.^]  f=fn)  • 

n+ 1 

be  the  optimal  capacities  assignment  for  fixed 

and  let  T  ,(f)  be  as  from  Equation  (8.1),  using 
~  n+1  ~ 

linear  approximations  of  the  cost-capacity  curves  around 

cn+1. 

If  I  T 


Let  C 
.n+1 


...n+1.  _  .  _n. \  ,  ,n  ,  ,  ... 

. (f  )  >  T  (f  )  i,  stop;  f  is  a  local  minimum. 
n+1  -  —  n  ~  / 


Otherwise,  let  n  =  n  +  1  and  go  to  1. 


*The  optimal  assignment  of  capacities ,  given  the  flows  and  the 
total  dollar  investment ,  for  concave  cost- capacity  functions, 
has  been  discussed  by  J'.l^inrock  [6). 
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The  result  of  the  above  described  algorichm  is  a  local 
minimum  for  the  continuous  cost-capacity  problem.  In  order  to 
get  a  solution  for  the  discrete  problem,  the  capacities  and 
flows  given  by  the  algorithm  are  "adjusted"  in  the  following 
manner:  in  all  arcs,  the  capacity  is  increased  to  the  upper 
value  of  discrete  capacity  available  (thus  increasing  the  total 
investment  to  D  >  D^) ;  then,  the  routing  is  optimized  once  again 

with  the  FD  routing  algorithm. 

The  above  described  technique  is  clearly  suboptimal.  We 
cannot  guarantee  that  the  solutions  so  found  are  local  minima; 
in  fact,  it  is  not  even  possible  to  define  a  local  minimum  in 
a  discrete  problem.  Other  suboptimal  techniques  have  been  pro¬ 
posed  (7,10,21];  however,  the  optimization  of  a  network  with 
discrete  capacities  still  remains  a  formidable  (and  basically 
unsolved)  problem.* 


O  r-IOOS  (BITS/SEC  i  NODE  PAIR) 
•  r*925  (BITS/SEC  »  NODE  PAIR) 
A  r  *846  (BITS/SEC  »  NODE  PAIR) 
a  r  *771  (BITS/SEC  l  NODE  PAIR) 
Dg  •  COST  Of  ALL  50  K  BITS  NET 
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°0 


I  ‘  ■ _ i _ < - 1 _ I _ 4 _ I 

73  80 

0(K  I/MONTH) 


Fig.  6  Delay  T  versus  cost  D  of  various  undominated  capacity 
assignments  for  different  traffic  levels. 


*The  optimum  solution  can  be  obtained ,  with  dynamic  programming 
techniques ,  in  the  special  case  of  a  centralized  network  [30]. 
In  fact,  for  such  a  case ,  the  problem  reduces  to  the  optimal 
assignment  of  capacities  only,  as  the  flows  are  already  deter- 
rrm’^ed  by  the  tree-structure  topology. 
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The  technique  has  been  applied  to  the  design  of  the  ARPA 
Network.  Four  cases  have  been  run,  each  with  a  different  value 
of  uniform  requirement  r  (see  Figure  5) .  The  initial  cost  DQ 

was  made  equal  to  the  cost  of  the  proposed  network  with  all  50 
kbit  channels  (DQ  -  71,000  $ /month) .  In  order  to  be  able  to 

compare  the  50  kbit  capacities  assignment  to  the  assignments 
found  with  the  FD  method,  the  minimum  xay  T,  with  all  50  kbit 
capacities  (i.e.,  with  total  cost  D  =  D^) ,  was  reported  on  the 

graph  for  each  value  of  r  (T  was  obtained  from  the  curves  in 
Figure  4) .  The  delay  T  and  the  total  cost  D  of  the  undominated* 
solutions  are  plotted  in  the  graph  of  Figure  6. 

10.  CONCLUSION 

The  FD  method  can  be  applied  to  -uiy  unconstrained,  m.c. 
flow  problem  when  some  reasonable  assumptions  on  P(f)  are  satis¬ 
fied.  It  also  can  be  applied  to  constrained  flow  problems:  in 
particular  to  problems  that  include  the  constraints  as  penalties 
in  P(f),  or  that  have  been  decomposed  with  the  Dar.tzig-Wolfe 
method.  Local  minima  are  in  general  attained;  for  convex  prob¬ 
lems,  the  global  minimum  is  found. 

The  FD  method  seems  to  be  an  efficient  tool  for  the  design 
of  S/F  networks:  for  example,  if  we  consider  the  optimal  rout¬ 
ing  problem,  it  can  be  shown  [19]  that  the  amount  of  computation 
per  iteration  required  by  the  FD  method  is  comparable  to  that 
of  the  heuristic  techniques  so  far  proposed  [16,24].+  A  general 
statement,  however,  about  the  effectiveness  of  the  FD  method  as 
compared  to  other  methods  would  not  be  appropriate:  many  fac¬ 
tors,  which  depend  on  the  specific  application  (like  trade-off 
between  precision  and  computational  speed)  should  be  considered 
in  order  to  select  the  proper  approach. 

APPENDIX:  CASE  OF  P(f)  STRICTLY  CONVEX 

If  P(f)  is  strictly  convex,  a  direct  proof  of  convergence 
of  the  FD  algorithm,  defined  in  Section  5,  is  available  and  a 
lower  bound  can  be  established. 

*A  solution  (T.}D.)  is  said  to  he  dominated  by  (T  .,D ,)  if: 

(D .  <  D.)  and  (T .  <  T .) 

3  v  ,7  t- 

A  solution  is  undominated  if  it  is  not  dominated  by  any  other 
solution. 

+ The  two  bottlenecks ,  common  to  both  approaches,  are  the  short¬ 
est  route  computation  and  the  flow  assignment  [16]. 
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Convergence 

He  want  to  show  that: 


lim  fn  «  f* 
!>♦<» 


(A.l) 


where  f*  is  the  global  minimum  of  P(f)  on  F,  and  {f11}  is  the 
sequence  generated  by  recursive  application  of  the  FD  operator 

on  a  given  starting  flow  f°.  The  associated  sequence  (Pff”)} 

is  monotonically  non-increasing  and  lower  bounded  fay  P*  ^  P(f*) , 
therefore  it  must  converge: 


lim  P(f°)  =  P'  >  P* 
n 


Also,  recalling  that: 


(A.  2) 


where 


P(f")  -  P'  =  l  AP(f*) 
l=n 


o  a  i.  i.  I  ?+l 

AP(f*)  =  P(fX)  -  P(FD  ©  fX)  =  P(fX)  -  P(fX  ) 


and  recalling  that: 

AP(f£)  >  0  V£ 

we  have,  from  Equation  (A. 2): 

lim  APff”)  =  0 
n~K= 


(A.  3) 


Suppose  (A.l)  is  false?  this  implies,  since  P(f)  is  strictly 
convex,  that  P'  >  P*.  However,  in  such  a  case^  we  are  able  to 
establish  a  relation  which  contradicts  Equation  (A. 3)  as  fol¬ 
lows. 

Let  us  first  establish  a  lower  bound  on  AP(f).  Let: 


P(A)  =  P((l  -  A)  f  +  Ay],  0  A  <  1 

where:  v  is  the  shortest  route  flow  computed  at  f. 
Taylor's  expansion: 


Using 


P(\)  =  P(0)  +  a[~]  +  -  A2  ^-|1 

LdAjA=0  2  LdA  JA=£ 


(A.  4) 
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where  t,  is  a  proper  value  in  the  interval  (0,X)  as  usual.  By 
assumption,  the  second  partial  derivatives  of  P(f)  are  upper 
bounded;  therefore,  the  second  directional  derivative  is  also 
upper  bounded,  and  Equation  (A. 4)  becomes: 


P (A)  -  P(0)  <  X  0  +  -  X2M 

2 


where 


(A.  5) 


9=  l 


(A. 5) • 


2  2 

M:  upperbound  on  d  P/dX  . 

After  minimizing  both  sides  of  Equation  (A. 5)  over  X,  and  re¬ 
calling  that  min  [P(X)  -  P(0)]  =  -AP(f) ,  we  get: 


AP(f)  > 


0/2M  if  -0/M  <  1 
M/2  if  -6/M  >  1 


Equation  (A. 6)  can  be  rewritten  as  follows: 

M  fl2 

AP(f)  >  ^min  ~,1 
2  M 


(A  6) 


Inequality  (A. 6)'  represents  a  useful  lower  bound  on  AP(f). 
Consider  now: 

P(X)  =  P[(l  -  x)fn  +  X-f*] 
where:  0  <_  X  <_  1 

P(X)  is  strictly  convex,  therefore  it  lies  above  its  tangent 
line  at  X  =  0: 


P (X)  >  P(f  )  +  X 


where:  l,  = 
k 


ll  +  xf  f  t.(f *  -  f") 

[kii  k  *  *J 

'  Kl?" 


Letting  X  *  1  in  {A. 7)  and  recalling  from  (A. 2)  that  P(jn)  >.  P' 


P(f*>  =  P*  >  P'  +  l  tk(fj  -  f") 
k=l 


* Notice  that  M  >  0  as  P(\)  is  strictly  convex. 
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Let  v  be  the  shortest  route  flow  computed  at  f  ;  we  have,  from 
Equation  (A. 8) : 

b 

P*  >  p'  +  l  £,k(vk  -  f£)  (A. 9) 


From  (A. 9) ,  using  definition  (A. 5)’,  we  have: 
p'  -  p*  <_  | e | 

Introducing  (A. 10)  into  (A. 6)'  we  get: 


,  ,n  M 
AP(f  )  >  —  min 

*“  —  O 


(p‘  -  P*)2 


,  1  >  0 


(A. 10) 


(A. 11) 


The  r.h.s.  of  Equation  (A. 11)  is  independent  of  n  and  strictly 
positive,  therefore: 


lim  AP(f  )  >  0 


(A. 12) 


Equation  (A. 12)  contradicts  Equation  (A, 3).  Therefore  (A.l)  is 
true. 

Lower  Bound 

By  replacing  fn  with  a  generic  f  c  F  in  (A. 7) ,  and  letting 
\  =  1,  we  get,  after  a  few  steps: 

b 

P(f*)  >  P(f)  +  l  £k(vk  -  fR)  (A. 13) 

k=l 

where:  f*  is  the  global  minimum 

v  is  the  shortest  route 
flow  computed  at  f 

From  (A. 13)  ,  lower  and  upper  bounds  on  P(f*)  are  readily  avail¬ 
able  : 

b 

P(f)  >  P(f*)  >  P(f)  +  l  ik(vk  -  fR) 


b 

Notice  that  the  test  for  optimality  based  on  £  £  (v  ~  ^k^ 

k=l 

(see  Section  5)  is  very  powerful  in  the  case  of  P(f)  strictly 
convex,  as  it  provides  an  upper  bound  on  the  optimal  value 
error. 
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ABSTRACT 

A  AutiXching  network  may  be  informally  described  as  a  col¬ 
lection  of  single-pole ,  single-throw  switches  arranged  so  as 
to  connect  a  set  of  terminals  called  Inputi  to  another  set  of 
terminals  called  outpatb.  It  is  non-blocking  if,  given  any  set 
of  connections  from  some  of  the  inputs  to  some  of  the  outputs, 
and  an  idle  input  terminal  x  and  idle  output  terminal  y,  then 
it  is  possible  to  connect  x  to  y  without  disturbing  any  of  the 
existing  connections.  Denote  by  o(a,b)  the  minimal  number  of 
switches  necessary  to  connect  a  inputs  to  b  outputs  using  a 
non-blocking  network.  We  are  interested  in  studying  the  growth 
of  a(a,a)  as  a  ■*  Re&ilts  of  C.  Clos  show  that  a(a,a)  <_ 

q  ae?^°Q  a’l°9  % \  ye  gfou  ffaj.  a(ata)  Badog^o)2 . 

1.  INTRODUCTION 

A  network  N  consists  of  a  graph  G;  two  sets  of  vertices  of 
G,  denoted  A  and  B  and  called,  respectively,  the  (sets  of)  in¬ 
puts  and  outputs ;  and  a  set  P  of  paths  of  G.  Each  path  in  P 
connects  an  input  to  an  output  c;nd  meets  no  other  inputs  or 
outputs.  We  write  N  =  (G,A,B,P).  A  state  of  N  is  a  subset  S 
•  of  P  such  that  no  two  paths  in  S  have  a  common  vertex.  A  state 

S  defines  a  bijection  f  from  a  subset  of  A  to  a  subset  of  B  as 

t  b 

.  follows:  Suppose  p  e  S  and  p  connects  x  z  A  to  y  e  B;  put 
fg(x)  =  y,  and  repeat  this  for  each  path  in  S.  We  shall  say 

that  a  path  p  of  G  is  admissible  if  p  e  P.  If  x  is  a  vertex  of 

G  we  shall  say  that  x  is  busy  (in  the  state  S)  if  x  lies  on  a 

path  p  z  S;  otherwise  we  shall  say  that  x  is  idle  (in  the  state 
S) .  If  x  is  an  input  of  G  and  y  is  an  output  of  G,  we  shall 
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say  that  x  has  access  to  y  (in  the  state  S)  if  there  exists  a 
path  p  e  P  connecting  x  to  •  uid  such  that  S  U  (p)  is  a  state. 

A  network  N  «=  (G,A,B,P.  may  be  interpreted  as  a  switching 
device;  under  this  interpretation,  the  elements  of  A  are  con¬ 
sidered  as  input  terminals,  the  elements  of  B  are  considered  as 
output  terminals,  and  the  edges  of  G  are  considered  as  single¬ 
pole,  single-throw  switches  which  are  normally  open.  Then  a 
path  p,  which  connects  x  e  A  to  y  e  B  may  be  thought  of  as  a 
sequence  of  switches  which,  when  closed,  connect  x  to  y.  The 
state  S  yields  a  collection  of  switches  (all  edges  on  any  path 
in  S)  which,  when  closed,  connect  inputs  to  outputs  in  the  man¬ 
ner  described  by  the  function  fg. 

The  network  N  =  (G,A,B,P)  is  said  to  be  non-blocking  if 
given  any  state  S  of  N  and  idle  vertices  x  e  A,  y  e  B,  then  x 
has  access  to  y  in  the  state  S.  In  terms  of  the  switching  net¬ 
work  interpretation  mentioned  above,  this  means  that  if  x  and  y 
are  idle  input  and  output  terminals,  respectively,  then  it  is 
possible  to  establish  a  connection  between  them  without  dis¬ 
turbing  the  existing  connections. 

Fran  now  on,  all  the  networks  we  study  will  have  disjoint 
inputs  and  outputs  (i.e.  A  f)  B  =  0) . 

Given  positive  integers  a  and  b  we  are  interested  in  find¬ 
ing  those  ncn-blocking  networks  N  =  (G,A,B,P)  with  |a|  =  a, 

| B I  =  b  for  which  the  number  of  edges  of  G  is  minimal.  We  shall 
denote  this  number  by  o(a,b).  In  terms  of  switching  networks, 
this  amount  to  finding  non-blocking  networks  using  a  minimal 
number  of  switches.  An  obvious  non-blocking  network  with  a  in¬ 
puts  and  b  outputs  is  the  network  whose  graph  is  the  complete 
bipartite  graph  on  vertex  sets  A  and  B  with  |a|  =  a  and  | 3 |  =  b. 
In  this  graph  the  set  of  vertices  is  A  U  B  and  there  is  an  edge 
connecting  each  vertex  in  A  to  each  vertex  in  B.  The  set  P  con¬ 
sists  of  all  paths  consisting  of  exactly  one  edge.  Thus  P  has 
ab  elements.  In  the  switching  network  interpretation,  this 
amounts  to  an  a  by  b  crossbar  switch.  When  the  names  of  the 
sets  A  and  B  are  unimportant ,  we  shall  denote  this  network  by 

C  .  .  The  network  C  .  shows  that  o(a,b)  <  ab. 
ab  ab  —  2 

It  was  Clos  [2]  who  showed  that  c(N,N)  <  N  for  all  large 
N.  His  methods,  which  will  be  described  later,  show  that  o(N,N) 

<_  C  Ne^^°^  2)^  We  show  that  a(N,N)  <_  8N(log2  N)  2 

We  do  not  attempt  to  obtain  the  smallest  possible  constant  mul¬ 
tiplier,  for  it  is  not  clear  that  the  exponent  2  can  not  be  re¬ 
duced.  In  the  opposite  direction,  an  elementary  argument  shows 
that  a  (N,N)  >  C  N  log2  and  nothing  stronger  is  known. 
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cussions  with  Professors  B.  Gordon  and  C.  B.  Tompkins. 

2.  CONSTRUCTIONS 

We  shall  say  that  networks  N  =  (G,A,B,P)  and  N^  =  (G^,A  , 

Bl'Pl^  * re  iBcmorPhic  (or  equivalent)  if  there  exists  a  graph 

isomorphism  y  of  G  onto  G^  such  that  y(A)  -  A  ,  y(B)  =  B^,  and 

P (P)  =  P  .  It  is  clear  that  the  property  of  being  non-blocking 

is  preserved  under  isomorphism. 

If  N  =  (G,A,B,P)  is  a  network,  we  define  its  transpose  N' 
to  be  the  network  N*  =  (G,B,A,P);  clearly  N"  =  N. 

If  G  is  a  graph  and  C  is  a  set,  we  define  the  graph  G  x  C 
to  be  the  graph  whose  vertices  are  the  ordered  pairs  (x,c)  with 
x  a  vertex  of  G  and  c  e  C;  ((x^jc^),  (x2*c2^  is  an  edge  of 

G  x  c  if  Cj  =  c2  and  (x^x^  is  an  edge  of  G.  If  p  is  a  path 

in  G  whose  vertices,  in  order,  are  x  ,  x  ,  . ..,  x  then  by 

p  x  c  we  mean  the  path  in  G  x  C  whose  vertices  are  (xQ,c) , 

(x^c) ,  . ..,  (x^.c) .  The  product  C  x  G  is  defined  similarly. 

Now  suppose  L^  =  (G^,A^,B^,P^)  (i  =  1  or  2)  are  networks; 

we  are  going  to  define  the  network  product  L2-  We  shall 

denote  this  product  by  N  =  (H,C,D,Q) .  Put  C  =  A^  x  and  D  = 

B  x  B  .  The  graph  H  is  obtained  from  the  two  graphs  G  x  A 

^  X. 

and  x  G2  by  identifying  the  vertices  in  x  which  appear 

in  both  graphs.  All  admissible  paths  q  e  Q  of  N  are  obtained  as 
follows:  Let  p^  e  P^  be  an  admissible  path  connecting  x^  e  A^ 

to  y.  e  B.  (i  =  1  or  2).  Then  p^  x  x2  ends  in  the  vertex  (y^x,^ 

which  is  the  first  vertex  of  y^  x  P2-  The  path  q  =  (p^,P2>  is 

defined  to  be  the  path  obtained  from  the  paths  p^  x  x2  and 

^1  x  ^2  concatenating  them  and  identifying  the  common  vertex 

^yl,X2^'  Note  that  this  maps  P^  x  P2  onto  Q. 

In  the  switching  network  interpretation  this  construction 
amounts  to  taking  | |  copies  of  L^  and  j  j  copies  of  L2,  and 

connecting  the  outputs  of  each  of  the  copies  of  L^  to  the  inputs 
of  all  of  the  copies  of  L2  'see  Figure  1) . 
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The  lines  do  not  represent  edges;  they  connect 
output  vertices  of  to  the  input  vertices  of 

with  which  they  are  identified. 

Fig.  1  1^  x  I*2 


Let  a^,b^,c,d  denote,  respectively,  the  cardinalities  of 
A^,B^,C,D,  and  let  g^,h  denote,  respectively  the  number  of 


edges  of  and  H 


The  following  relationship  between  two  by 
two  matrices  is  easily  verified 


f1  °)f2  °w°  °v 

\91  V'g2  V  'h  d' 


(i) 


If  L^  is  isomorphic  to  and  L2  is  isomorphic  to^  M2  it  is 
easy  to  verify  that  L^  x  L2  is  isomorphic  to  x  M2*  Further¬ 
more  (L^  x  L^)  *  =  1*2  x  Lj.  Finally,  we  have  associativity: 

(L^  x  L2)  x  L.j  =  x  (L2  x  L^) ;  we  vill  usually  write  simply 
L1  X  L2  x  Ij3‘  We  abbreviate  the  k-fold  product  L  x  L  x 


k 

•••  x  L  by  L  . 

We  also  define  a  triple  product  of  the 

(G . , A . ,B . ,P . )  (i  =  1,2,3)  when  |b. |  =  |a,|. 
1111  13 

jection  from  A^  onto  B^;  the  triple  product 
upon  the  choice  of  x  and  will  be  denoted  by 


three  networks  L.  = 
1 

Let  x  be  a  bi- 
of  L^ ,L2 ,L^  depends 
[L ,L2 ,L^] ^ •  (In 


i 

1 


1 

i 

i 
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many  cases  will  be  Lj  and  in  such  cases  we  will  choose  t  to 

b?  the  identity  map.  In  any  case  those  properties  of  the  triple 
pioduct  which  we  will  use  will  be  independent  of  the  choice  of  t 
and  we  will  frequently  write  iL^L^L^l  instead  of  (L^  ,L_  ,L^]  . ) 

Suppose  then  that  N  *  (H,C,D,Q)  is  . 

We  put  C  =  A^  x  and  D  =  x  B^;  H  is  defined  as  the 

graph  obtained  from  the  three  graphs  x  A2’  ®1  x  G2'  and 

G3  x  B2<  by  identifying  B3  x  A2  in  G3  x  A2  with  B^  x  A2  in 

B3  *  G2<  and  by  identifying  A3  x  B2  in  G3  x  B2  with  t (A3>  x  B2  = 

Bj  x  B2  in  B^  x  G2.  The  admissible  paths  q  e  Q  are  obtained  in 

the  following  way:  Let  p^  be  an  admissible  path  of  connecting 

x.  e  A.  to  y.  £  B.  (i  =  1,2,3)  and  suppose  t(x3)  =  y  .  Then 

P1  X  X2  ends  at  ^yi'X2^  '  yi  x  P2  begins  at  (y1»*2)  and  ends  at 

(y1»y2);  and  P3  X  y2  begins  at  (x3,y2>  =  *yi'y2**  The  path  q  is 

obtained  by  concatenating  p^  x  x2#  y3  x  p2»  p3  x  y2  and  identi¬ 
fying  the  vertices  common  to  two  segments  of  q. 

Note  that  [L^ ,L2 ,L31  is,  in  general,  different  from 

Lj  x  L^  x  L3  (see  Figure  2) . 


The  lines  do  not  represent  edges;  instead  they 
connect  vertices  which  are  to  be  identified. 
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Theorem  2.1:  Suppose  L.  =  (G  .,A  .,B  .,P .)  (i  =  l, 2,3, 4,3)  are 
networks.  Suppose  x  is  a  bisection  of  A r  onto  Then 

ILjt  1^2*  ^3*^4^  x  ,^J3^x  =  ^2  X  ^ '^'3> x  ^'Yx* 

1  2 


where  x^  is  the  bisection  from  A ^  x  A ^  to  B ^  x  B ^  given  by 
x(a4ias)  =  (x2(as)tx^(a4)). 


3.  THE  CLOS  METHOD  AND  SOME  VARIATIONS 

The  basic  method,  due  to  Clos  [2]  and  quoted  by  Bex  es  [1] 
may  be  stated  as  the 

Theorem  (Clos):  Suppose  L  =  (G,A,B,P)  is  non-blocking  and 
s  >_  2r  -  1.  Then  N  =  ^rs*^t^8r^  i 8  non-blocking. 

This  is  a  special  case  of  the  following  more  general 

Theorem  3.1:  Suppose  L.  =  (G..A  .,B.,P .)  (i  =  1.2.3)  are  non- 

1>  %  %  7,  t- 

blocking,  that  \B.  |  >_  J -4^  |  +  [S^|  -  1,  and  that  |B^|  «  |>4  |. 

Then  N  =  lLltL0,L7]  is  non-blocking  for  any  bisection  x  of  A, 

T  i  &  «5  T  «5 

onto  B^. 

Proof:  Suppose  NT  =  (H,C, D,Q)  is  in  state  S,  and  that  x  e  C, 
y  t  D  are  idle.  We  must  show  there  exists  a  path  q  e  Q  con¬ 
necting  x  to  y  and  having  no  common  vertices  with  any  path  in  S. 

Suppose  x  =  (u.^,^)  e  x  Aj  and  y  =  t  B3  x  There 

are  |a^[  vertices  of  the  form  (u,u2)  e  A^  x  A 2  and  at  most 

|  A  [  -  1  of  thv.*n  are  busy.  Hence  at  most  |  A^  |  -  1  of  the  |  B^  | 

vertices  of  the  form  (y,u2)  e  B^  x  are  busy  and  hence  at 

least  jB.J  -  | A1|  +  1  of  them  are  idle.  Denote  these  vertices 

by  (y.  .u  My,  ,u  )  ,  ...,  (y,  ,u2)  ,  so  that  r  >_  | B  |  -  IaJ  +  1 

Similarly,  there  are  vertices  {z.  ,v  ),(z.  ,V  ) ,  ...,  (z.  ,v  ) 

X1  1  12  Xs 

in  A^  x  B2  which  are  idle,  and  s  >_  |a^|  -  | B^ |  +1.  The  r  +  s 

vertices  y.  ,y.  ,  ...,  y.  ,  x(z.  ),x(z.  ),  ...,  x(z.  )  all  lie 
11  X2  1r  *1  x2  Xs 
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in  B^  and 

r  +  s  1  |b1I  -  |ax|  +  1  +  |b1|  -  |b3|  +  1 
>  |b1|  +  1  +  <|bJ  -  |aJ  -  |b3|  -  1) 
1  IbJ  +  i. 


So  two  of  them  must  be  the  same.  Now  the  v.  cure  all  distinct 

l . 

3 

and  so  are  the  t(z.  ).  Thus  there  must  be  a  y.  equal  to  a 

t(z.  ),  say  y.  =  t(z  ).  Since  L  is  non-blocking  there  is  a 
Ak  11  31  1 

path  p^  connecting  u^  to  y^  and  such  that  p^  x  u2  has  no  conmon 

vertices  with  any  vertex  in  S.  Similarly  there  is  a  path  p2 
from  u2  to  v2  in  P2  such  that  y^  x  P2  has  no  common  vertices 

with  any  path  in  S,  and  there  is  a  path  p  from  z.  to  v  in  P 

j  3^  j  j 

such  that  p^  x  v2  has  no  vertex  in  conmon  with  any  path  in  S. 

Let  q  be  the  concatenation  of  p^  *  u2»  x  p2»  aiM*  P3  x  V2 

with  the  appropriate  vertices  identified.  Then  q  connects  y.  to 
y  and  S  U  (q)  is  a  state  of  N^ . 

Remark  3.2:  Suppose  a^  =  |a^| ,  b^  *  |b^|  and  g^  is  the  number 

of  edges  of  G^  {i  =  1,2,3) .  It  is  easy  to  verify  using  (1)  that 

N  *  [L^.L^L^]  has  a^a2  inputs,  b2b3  outPuts  and  that  its  graph 
has  a2g  +  b^  +  b2g3  edges. 


Clos  [2]  suggests  using  networks  which  may  be  described  as 
[L, [L, [L,  ...,  [L,M,L'3 ,L'] ,L'] ,  ...,  L'] 


where  L  =  C  _  ,  and  M  =  C  .  By  Theorem  2.1,  this  is  the 

n  f  n ,n 

same  as  [Lfc,M, (L1 ) fc} ,  where  Lt=LxLxLx  •••  x  L  (t  times). 

He  shows  that  this  non-blocking  network,  which  has  nt+1  inputs 
and  outputs,  has 


n  (2n  -  1) 
n  -  1 


[ (5n  -  3) (2n 


..H  ,  t. 

1)  -  2n  ] 


edges.  This  follows  immediately  from  the  above  remark.  It  is 
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easy  to  verify  that  a  non-blocking  network  with  N  inputs  and 
outputs,  constructed  by  this  method,  will  require  at  least 


C«e2/log  N‘log  2 


edges,  where 


>  0  is  a  constant. 


Suppose  that  L^  denotes  a  network  with  a  inputs,  b  out¬ 
puts,  aad  whose  graph  contains  a  minimal  number  of  edges,  namely 

c(a,b).  Using  two  copies  of  L  shows  that  a(a,2a)  <  2a(a,a). 

aa  ™ 

By  Theorem  3.1,  (L  ,L  ,L  ]  is  non-blocking  and  by  Remark 
a  /  ^a  a, a  ^a  f  a 

3.2,  it  has  <_  ao(a,2a)  +  2ao(a,a)  +  aa(2a,a)  <_  6ao(a,a)  edges. 
Thus 


2  2 

a  (a  ,a  )  <  6aa  (a,a)  .  (2) 

log  6 

Iteration  of  (2)  shows  that  a(N,N)  <_  C  N(log  N)  .  This  re¬ 
sult  can  be  improved  by  considering  [L  „  ,L  ,L_  ];  this 

a,^a  a §  2d  2a  ^ a 

network  has  ab  inputs,  2ab  outputs  and  its  graph  has  3bo(a,2a) 

+  2ao(b,2b)  edges.  This  shows  that 

o(ab,2ab)  <_  3ba(a,2a)  +  2aa(b,2b).  (3) 

Putting  a  *  b  and  iterating  (3)  shows  that 

log2  5 

o( a, 2a)  £C  a(log2  a)  and  since  a(a,a)  a(a,2a)  we  find 


that 

log  5 

o(N,N)  <_  C  N{log2  N) 


The  exponent  log2  5  can  be  decreased  by  choosing  a  and  b  dif¬ 
ferently.  Let  a  >  1  and  6  >  2  be  the  real  solutions  of  the 
simultaneous  equations 

e-i  ,  v 

“  ■  3  14) 

(a  -  l)8'1  -  3/2 > 

Numerical  computation  shows  that  o  -  2.37638  and  6  -  2.26922. 
Multiplying  the  second  equation  of  (4)  by  a  -  1  « nd  substituting 

g  B 

from  the  first  yields  2{a  -  1/  =  a  -  3  or  equivalently 

3(l/a)6  +  2(1  -  1/a)6  =  1.  (5) 

O 

We  now  show  that  if  y(x)  =  (log  x)p,  then  y  (x)  satisfies  the 
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functional  equation 

U  (z)  *  3p  (x)  +  2u  (y)  (6) 

where  x  =  z^a  and  y  =  z/x.  Indeed, 

3y(x)  +  2y(y)  =  3 ((log  z)/a)B  +  2 (log  z)8  (1  -  1/a) 8 

=  (log  z) 8 
*  p(z) 

using  (5) . 

Now  a(x,2x)/x  satisfies  a  functional  inequality  similar  to 
(6)  where  x  and  y  must  be  integers.  It  follows  that  for  each 
e  >  0,  the  exists  C^  >  0  spch  that 

a  |  p 

0 (N , 2N)  <_  C£  N(log  N)  . 

For  comparison,  log2  5  *=  2.32193. 

4.  THE  EXPONENT  IS  <_  2 

Suppose  L  -=  (G,A,B,P)  is  a  network  (not  necessarily  non- 
blocking)  .  We  shall  say  that  L  is  of  type  T(m,n)  if,  given  any 
state  S  of  L  and  m  idle  inputs  x, ,  x.,  ...,  x  of  L,  then  each 

12  IP. 

has  access,  in  the  state  S,  to  at  least  n  outputs  of  L. 

Lemma  4.1:  Suppose  L  =  (G,A, B3P)  is  of  type  T(m,m  +  n  -  1)  for 
1  <_  m  <_  k,  that  M  is  a  non-blocking  network  with  c  inputs  and  d 
outputs j  arid  that  nd  >_a(c  -  1) .  Then  L  x  M  is  of  type 
T(m3m  +  n'  -  1)  for  1  <_m  <_  k  where  n'  =  nd  -  a(c  -  1)  and  a  is 
the  number  of  inputs  of  L. 

Proof:  Take  k  <_  m  idle  inputs  z^,  z2 ,  ...,  z^.  Suppose,  for 
example,  that  z^,  z2,  ...,  z^,  are  of  the  form 

(*l'Yl) ' ^X2 9  *  *  *  9  | ,y^) , 

and  z  ,  ,  z  ,  ...,  z  are  of  the  form  (x  ,y.)  where  i  >  2; 

K  +1  K  +4  k  hi  — 

here  the  x^  are  inputs  of  L  and  the  y^  cure  inputs  of  M.  By 

hypothesis,  (x^,y^)  has  access  to  at  least  n  +  k’  -  1  vertices 

of  the  form  (u, »y.)  where  the  u.  are  outputs  of  L.  Since  M  is 
j  3  3 

non-blocking,  these  have  access  to  all  idle  vertices  of  the 
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form  (Uj,v^)  where  is  an  output  of  M.  There  are  (n  +  k*  -  l)d 

such  vertices.  However,  as  many  as  (c  -  l)a  -  (k  -  k')  of  these 
could  be  busy;  this  would  be  the  case  if  all  inputs  of  the  forrr. 


Oth'yi>'  where  i  >  2,  other  than  zk,  ,  \>+2‘ 


were  busy. 


Thus  z^  has  access  to  at  least 


(n  +  k‘  -  l)d  -  (c  -  l)a  +  (k  -  k1)  >  nd  -  (c  -  l)a  +  k  -  l 

*  n*  +  k  -  1 


output  terminals  of  L  x  M. 

Hie  following  theorem  provides  the  motivation  for  defining 
the  notion  T(m,n) . 


Theorem  4.2:  Suppose  M  is  a  non-blocking  network  and  L  is  a 
network  with  a  inputs t  b  outputs t  and  of  type  T(l,n).  If 
2n  >  b,  then  lLtMsL'}  is  non-blocking. 


The  proof  is  similar  to  that  of  Theorem  3.1  and  will  be 
omitted. 

Now  choose  an  integer  k>  1  and  put  L.  =  C  x  C;?  ; 

ZK  Z  t  Z 

if  1  <_  j  <_  k,  then  L.  has  2^  inputs,  k*2^  outputs,  and  induc- 

^  i-1 

tively  by  Lemma  4.1,  L^  is  of  type  T(l,2  (2k  -  j))  and 

T(2,2^  1(2k  -  j)  +1).  Thus  L.  is  of  type  T(2,k2k  1  +  1) .  Let 

*  k 

M  be  obtained  from  L.  by  omitting  one  input.  Then  M,  has  2-1 
k*  k  ^  k-1 

inputs  k*2  outputs,  is  of  type  T(l,k*2  +1),  and  its  graph 

has  no  more  edges  than  the  graph  of  Lu  .  The  associated  matrix  of 

\is 


(2  0\  /2  0Vk~l  yj  i  0  \ 

4k  2k/ \4  2 !  \2k2  k/ 


k  2 

Thus  has  <_  2  *2k  edges  and  if  N  is  any  non-blocking  network, 

then  by  Theorem  4.2,  so  is  [M.  ,N,MD .  Thus  putting,  for  example, 

*  k  k+1 

N  =  C  ,  we  obtain  a  non-blocking  network  with  (2  -  2)  inputs 

2,2  k+i  2 

and  outputs  whose  graph  has  <_  2  (4k  +  2k)  edges.  It  is  imme- 

2 

diate  that  o(N,N)  <_  8N(log2  N)  for  all  N  >_  2.  It  is  not  hard  to 

see  that  the  constant  8  could  be  considerably  decreased,  but  the 
major  open  question  is  the  value  of  the  exponent. 
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